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BORKA DISEASE VIRAL SEQUENCES , 
5 DIAGNOSTICS AND THERAPEUTICS FOR 

NERVOUS SYSTEM DISEASES 

This patent application is a continuation-in-part of 
U. S. patent application Serial No. 08/434, 831 , filed on 
10 May 4, 1995 , which is a continuation-in-part of U. S. 
patent application Serial No. 08/369,822, filed on 
January 6, 1995. 

This invention was made with Government support 
under Grant No. NS29425, awarded by the National 
15 Institutes of Health (NINDS) . The Government has certain 
rights in this invention. 



FIELD OF THE INVENTION 

The present invention relates to the field of 
20 virology, immunology, gene therapy, transplantation of 
viral transfected cells, and in vivo chemical delivery. 



BAS8B5SBBB OF TH3 invention 
Borna disease is an immune -mediated neurologic 

25 syndrome {Narayan, O., et al., Science 220:1401-1403 
(1983)} caused by infection with Borna disease virus 
(BDV) . BDV is a neurotropic, nonsegraented and negative - 
strand RNA virus that causes a progressive, immune- 
mediated neurologic disease characterized by disturbances 

30 in movement and behavior {Ludwig, H., et al., Prog. Med 
Virol, 35:107-151}. It causes fatal disease in expensive 
domestic animals. Although natural infection was 
originally considered to be restricted to horses and 
sheep in Southeastern Germany, recent studies suggest 

35 that BDV infects horses in North America {Kao, M. , et 
al., Vet Rec, 132:241-4 (1993)}, cats in Sweden {Lundgren, 
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A.-L., et al., Zbl. Vet. Med [BJ, 40:298-303 (1993)}, 
ostriches in Israel {Malkinson, M., et al., Vet. Rec, 
133:304 (1993)} and some human subjects with 
neuropsychiatric disorders in Europe and North America 
5 {Bode, L., et al., Arch. Virol. [Suppl] . 7:159-167 (1993) ; Bode, 
L., et al., Lancet, ii:689 (1988); Fu, Z. F., et al., J. 
Affect. Disorders, 27:61-68 (1993) and Rott, R. , et al., Science, 

228:755-756 (1985)}. 

Experimental infection in rats {Narayan, O., et al., 
10 Science, 220:1401-1403 (1983)} results in a multiphasic 
syndrome characterized by hyperactivity, stereotyped 
behaviors, dyskinesias and dystonias. 

Though natural infection has not been reported in 
primates, subhuman primates can be infected 
15 experimentally {Sprankel, H., et al., Med Microbiol. Immunol 
165:1-18 (1978) and Stitz, L. , et al . , J. Med. Virol. 6:333- 
340 (1980) } . Antibodies to BDV proteins have been found 
in patients with neuropsychiatric disorders {Rott, R. , et 
al., Science 228:755-756 (1985); Fu, Z. F., et al., J. Affective 
20 Disord. 27:61-68 (1993) and Bode, L., et al.. Arch Virol. 
(Suppl.) 7:159-167 (1993)}. 

Because BDV grows only to low titer, it was 
difficult to purify for analysis. However, the 
identification of BDV cDNA clones by subtractive 
25 hybridization {Lipkin, W. I., et al., Proc. Natl. Acad Sci. USA 
87:4184-4188 (1990) and VahdeWoude, S., et al., Science 
250:1276-1281 (1990)} and, more recently, the advent of 
a method for isolation of virus particles {Briese, T., et 
al., Proc. Natl. Acad Sci. USA 89:11486-11489 (1992)} led to 
30 partial characterization of BDV as a negative -strand RNA 
virus which transcribes its RNA in the cell nucleus 
{Briese, T. , et al., Proc. Natl. Acad. Sci. USA 89:11486-11489 

(1992) }. 

The diagnosis of BDV infection is based on the 
35 appearance of a clinical syndrome consistent with 
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disease, and the presence of serum antibodies that detect 
viral proteins in infected cells by indirect 
immnunofluorescent test (IFT) {Pauli, G., et al. # Zbl Vet. 
Med [B] 31:552-557 (1984)}, Western blot (WB; 
5 immunoblotting) or immunoprecipitation (IP) {Ludwig, H. , 
et al., Prog. Med Virol, 35:107-151 (1988)}. These methods 
are cumbersome and difficult to use for large surveys of 
human and livestock populations. 

10 SUMMARY OF THE INVENTION 

One aspect of the invention presents the nucleotide 
and amino acid sequences of Borna disease virus (BDV) , 
their derivatives, the vectors for expressing them, and 
cells transfected by these vectors. 

15 Another aspect of the invention presents novel BDV 

viral proteins gp!8 and p57 and their respective 
recombinant proteins, recpl8 and recp57. Also disclosed 
are their nucleotide and amino acid sequences, vectors 
encoding them, cells transfected by these vectors, and 

20 antibodies directed to these proteins. 

Another aspect of the invention presents assays for 
detecting ligands which bind BDV proteins or their 
derivatives. Preferably, these assays are immunoassays 
for detecting antibodies to BDV protein or its 

25 derivatives. The assays are useful for detecting: (1) 
BDV infection or related pathogenesis; and (2) neurologic 
and neuropsychiatric disease not due to BDV infection. 
Preferably, p40, p23 or gp!8, and their synthetic 
versions or fragments are used in these assays. The 

30 preferred immunoassays are enzyme -linked immunosorbent 
assays (ELISAs) based on the use of recombinant viral 
proteins: recp40, recp23, and/or recplB, and/or the 
immunoreactive fragments of the foregoing, to detect 
ligands, such as antibodies, in the patient's biological 

35 sample, that are immunoreactive with these proteins. The 
assay can also be used to monitor the diseases by 
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monitoring the titer of Buch ligands. The titer of the 
iigands can also be prognosticate of the diseases. 

Another aspect of the invention presents alternative 
methods for detecting the above diseases by detecting the 
hybridization of nucleotide sequences in a patient's 
biological sample with the nucleotide sequences coding 
for BDV protein or its derivatives. 

Another aspect of the invention presents assay kits 
for the above diagnostic tests. 

Another aspect of the invention presents vaccines 
against the above diseases. 

Another aspect of the invention presents synthetic 
peptides, based on truncated BDV protein, useful for 
immunoassays for detecting antibodies to BDV or for 
15 raising antibodies for the therapeutic uses described in 
the next paragraph. The method for obtaining these 
peptides are also presented. 

Another aspect of the invention presents methods, 
using ligands or chemicals such as antibodies, capable of 
20 binding to BDV proteins or their derivatives, for 
treating: (1) BDV infection or related pathogenesis; and 
(2) neurologic and neuropsychiatric disease not due to 
BDV infection. Examples of such antibodies are those 
specific to gpl8 and p57. Also presented are these 
therapeutic agents, methods for screening for them, 
especially those that bind to the immunogenic epitopes of 
BDV protein. The methods for producing the antibodies 
are also presented. 

Another aspect of the invention presents a BDV-based 
viral vector useful for in vivo delivery of genes and 
chemicals to the nervous system. Also disclosed are: the 
cells transfected by the viral vector and cell lines 
derived therefrom, the in vitro harvesting of the gene 
product from such cells and cell lines, and the 
35 transplant of such cells into animals. 

Other aspects and advantages of the invention will 
be apparent to those skilled in the art upon 



25 



30 
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consideration of the following detailed description which 
provides illustrations of the invention in its presently 
preferred embodiments. 

5 Brief Des cription of the Drawings 

FIG. 1 presents the genomic organization and 
transcriptional map of BDV. 

FIG. 2 shows the complete genomic sequence of BDV 
(strain V) in 5' to 3' cDNA with the deduced amino acid 
10 sequence shown below the cDNA. 

FIG. 3 (a) presents the organization of the BDV 
genome; (b) presents the coding potential of the genome. 

FIG. 4 shows alignment of the pl80 (also referred to 
as "pol") open reading frame (ORF) and negative- strand 
15 RNA virus L-polymerase amino acid sequences with PILEUP 
computer program (Sequence Analysis Software Package, 
Genetics Computer, Inc., Madison, Wisconsin). BDV 
sequence is indicated with double arrowheads. 
Rhabdoviridae : RaV, rabies virus; VSV, vesicular 
20 stomatitis virus; SYN, sonchus yellow net virus. 
Paramyxoviridae : MeV, measles virus; SeV, Sendai virus; 
NDV, Newcastle disease virus; RSV, respiratory syncytial 
virus. Filoviridae: MaV, Marburg virus. 

FIG. 5 presents sequence analysis of BDV genomic 
25 termini. (a) Similarity of 3 '-terminal BDV sequence to 
leader regions of Rhabdoviridae (RaV) , VSV) , 
Paramyxoviridae (MeV, SeV, NDV, RSV) , and Filoviridae 
(MaV); (b) Comparison of complementarity at 3' and 5' 
termini of BDV genomic RNA with that of four other 
30 nonsegmented, negative -strand RNA viruses. 

FIG. 6 presents the map of BDV subgenomic RNAs 
relative to the viral antigenome. (a) Northern 

hybridization analysis of rat brain poly (A) * RNA; (b) 
position of viral transcripts with respect to antigenome 
35 as determined by Northern hybridization and sequence 
analysis; (c) alignment of the seven potential 
termination sites of BDV. 
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FIG. 7 presents the sequence of ORF gplB. 
FIG. 8 shows glycan determination of gpl8. Lanes: 
0, protein detection by mouse anti-gplB serum; 1, ConA; 
2, wheat germ agglutinin; 3, D. stramonium agglutinin; 4, 
BS-I; 5, BS-II; 6, G. nivalis agglutinin; 7, S. nigra 
agglutinin; 8, M. amrensis agglutinin; 9, peanut agglutinin. 
Positions of molecular weight markers are shown in 
kilodaltons at the right. 

FIG. 9 presents treatment of gplB with buffer alone 
or endoglycosidase. Lanes: 1. buffer; 2, 

endoglycosidase F plus Af-glycosidase F; 3, 
endoglycosidase F (AT-glycosidase free); 4, endo-0- 
galactosidase. Positions of molecular weight markers are 
shown in kilodaltons at the right. 
15 FIG. 10 presents in vitro transcription, translation, 

and cotranslational processing of gpl8. (A) Lanes: 1, 
pBDV-23 RNA; 2, pBDV-23 RNA plus microsomal membranes; 3, 
pBDV-gpl8 RNA; 4, pBDV-gpl8 RNA plus microsomal 
membranes; 5, pBDV-gpl8 RNA plus microsomal membranes, 
20 incubated with endoglycosidases. (B) Lanes: 1, pBDV- 
gpl8 RNA; 2, pBDV-gpl8 RNA plus microsomal membranes; 3, 
pBDV-gpl8 RNA plus microsomal membranes, incubated with 
endoglycosidases . 

FIG. 11 presents Western blot analysis of native and 
25 recombinant proteins with monospecific antisera to 
recombinant proteins and sera from infected rats. (A) 
Lane 1, C6BDV lysate; lane 2, recp40; lane 3, recp23; 
lane 4, recplS; lane 5, C6 lysate; lane 6, recp40, recp23 
and recplS, Lanes 1-4 were treated with serum from 
30 infected rat; lanes 5 and 6 were treated with serum from 
noninfected rat. (B) C6BDV lysates (lanes 1-3) and C6 
lysates (lanes 4 and 5) were incubated with: lanes 1 and 
4, serum from infected rat; lane 2, anti-p40 rabbit 
serum; lane 3, anti-p23 rabbit serum; and lane 5, pooled 
35 anti-p40 and anti-p23 sera. 
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FIG. 12 presents ELISA of infected rat serum reacted 
with recp40. Circles , recp40 and serum from chronically 
infected rat; squares, recp40 and serum from noninfected 
rat; triangles, BSA and serum from chronically infected 
5 rat. 

FIG. 13 presents timecourse for the appearance of 
antibodies to BDV-proteins. (A) recp40; (B) recp23; and 
(C) recpl8. 

FIG. 14 presents timecourse for the appearance of 
10 antibodies to BDV proteins in sera from individual rats 
after intranasal infection. (A) Neutralization activity 
in sera from BDV- infected rats at three timepoints (5, 10 
and 15 weeks post -infect ion) . (B) Plot of mean recpl8 
ELISA titers (open columns) with neutralization titers 
15 (hatched columns) at three time points (5, 10 and 15 
weeks post -infection) . Sera analyzed were the same as 
those in panel A. (C) Timecourse for the appearance of 
antibodies to recp40, recp23, and gpl8 by Western blot 
analysis . 

20 FIG. 15 presents (A) Immunoprecipitation of gpl8 

with monoclonal antibodies (Mabs) . Lanes: 1, serum from 
infected rat (15 week pi) ; 2, serum from noninfected rat; 
3, MAb 14/29A5; 4, MAb 14/26B9; 5, MAb 14/8E1; 6, MAb 
14/13E10; 7, MAb 14/18H7; 8, MAb 24/36F1 (MAb directed 

25 against the BDV 23 kDa protein, negative control) ; 9, no 
antibody. (B) MAbs were analyzed for binding to native 
gpl8 in Western blot. Lanes: 1, serum from infected rat 
(15 week p.i., D2) ; 2, serum from noninfected rat; 3, MAb 
14/29A5; 4, MAb 14/26B9; 5, MAb 14/8E1; 6, MAb 14/13E10; 

30 7, MAb 14/18H7; and 8 # MAb 24/36F1 (MAb directed against 
the BDV 23 kDa protein, negative control) . 

FIG. 16 presents neutralization profile of sera and 
MAbs, (A) Serum from noninfected rat. (B) serum from 
infected rat (15 week p.i., D2) . (C) MAb 14/13E10. (D) 

35 MAb 14/29A5. 

FIG. 17 presents precipitation of BDV with sera from 
infected rats. (A) Lanes: 1, serum from infected rat, 15 
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week p.i.; 2, serum from infected rat, 5 week p.i.; 3, 
serum from infected rat, 15 week p.i.. no BDV; 4, serum 
from infected rat, 15 week p.i., genome sense primer used 
for first strand cDNA synthesis. (B) Precipitation of 
5 BDV by monospecific antisera to recpl8 and MAbs to gpl8. 
Lanes: 1, monospecific rat antisera to recplS; 2, MAb 
14/13E10; 3, MAb 14/29A5. DNA markers (basepairs) are 
shown at the right. 

FIG. 18 presents the cDNA of BDV polymerase. "V" 
10 denotes the site of its intron which is located between 
nucleotide positions 2410 and 3703 in the figure. "1-2", 
denotes that this is the second intron in the BDV genome. 

FIG. 19 presents the partial cDNA genomic sequence 
for BDV strain HE/80. 
15 FIG. 20 graphically presents in A) the 

immunoreaction of truncated recp23 protein fragments with 
sera from 7 human schizophrenics (SZ Human), 4 BDV 
infected horses (BD Horse) and 6 BDV infected rats (BD 
Rat); and in B) the truncated recp23 fragments. 
20 fig. 21 graphically presents in A) the 

immunoreaction of truncated unglycosylated recplB protein 
fragments with sera from 7 human schizophrenics (SZ 
Human) , 6 BDV infected rats (BD Rat) and 2 mice immunized 
with native gpl8 (Mouse a gplB) ; and in B) the truncated 
25 unglycosylated recplS fragments. 

FIG. 22 graphically presents the overlapping 8-mer 
peptides, derived from p23, lined up diagonally from the 
amino (left) terminus to the carboxyl (right) terminus. 
Above the overlapping peptides are blocks indicating the 
30 immunoreactive regions of P 23 and presenting the mapped 
epitopes and their sequences. 

FIG. 23 graphically presents the overlapping 8-mer 
peptides, derived from unglycosylated recplS, lined up 
diagonally from the amino (left) terminus to the carboxyl 
35 (right) terminus. Above the overlapping peptides are 
blocks indicating the immunoreactive regions of 
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unglycosylated gpl8 and presenting the mapped epitopes 
and their sequences. 

FIG. 24 graphically presents A) the SPOTS tests; B) 
the locations of immunoepitopes along the length of 
5 unglycosylated gplB which are immunoreactive with the 
sera in the SPOTS tests of FIG. 24A. The sequences of 
the most immunoreactive epitopes are shown. The scale 
indicates by the darkness of the spots, the degree of 
immunoreaction. The lightest shade (Scale 1) indicates 
10 no detectable immunoreactivity; the darkest shade (Scale 
4) indicates highest immunoreactivity. 

FIG. 25 presents the predicted amino acid sequence 
and potential N-glycosylation sites of the BDV G-protein. 



15 DETAILED DESCRIPTION OF THE INVENTION 

BDV Protein. Its *™ ir*n ft,n; ld and Nucleotide Sequences 

Table 1 identifies the sequence ID Nos. with their 
respective nucleotide and amino acid sequences. 

20 Table 1 

Nucleotide a nd Amino Acid Sequences of Borna Disease 

Virus (BDV) 

Sequence ID fro. 



25 Nucleoti de Sequence 



p40 
p23 
gp 18 
p57 

BDV polymerase 
BDV genomic cDNA 



30 



35 



40 



Amino Acid Sequence 

p40 
p23 
gp 18 
P 57 

BDV polymerase 



1 
3 
5 
7 
9 

.19 
Sequence ID No. 

2 
4 
6 
8 

10 



•pol' 



or 



45 



BDV polymerase is also referred to as 
"pl80". 

The present application discloses the complete BDV 
genomic nucleotide sequence, the locations on the genomic 
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nucleotide sequence which encode the different BDV 
proteins, the sites of splicing and overlap (see FIGs. 1 
and 2) . Also disclosed are the novel nucleotide and 
amino acid sequences of BDV proteins gplB, pol and p57. 
5 The following Figures 1, 2, 19, and Table 1 summarize 
this information. 

FIG. 1 shows the genomic organization and 
transcriptional map of BDV. The BDV genome is shown as 
a solid line in 3' to 5' direction. Coding regions and 
10 their respective reading frames are represented as boxes 
at the top; the number above each upward vertical line 
indicates the nucleotide position of the first AUG codon 
in the respective ORF. Transcription initiation sites 
and their nucleotide positions on the viral genome (BDV 
15 strain V) are represented by arrows pointing downstream. 
Transcription termination sites and splice sites are 
indicated by downward vertical lines. Dashed lines 
indicate that readthrough at termination sites T2 and T3 
results in synthesis of longer RNAs terminating at T3 and 
20 T4, respectively. The 1.2 kb and 0.8 kb RNA have been 
shown to represent the mRNAs for p40 and p23, 
respectively. p23 could also be translated from the 3.5 
kb RNA. Transcripts that are likely to represent mRNAs 
for gpl8, p57 and pol are indicated. Note that gpl8 can 
25 only be translated from RNAs containing intron 1. 
Splicing of intron 1 preserves the gpl8 initiation codon 
but introduces a stop codon such that only the first 13 
amino acids could be translated from the 2.7 (7.0) kb 
transcripts and the RNA or the 1.4 kb RNA serve as 
30 messages for the translation of BDV proteins. 

FIG. 2 shows the complete genomic sequence of BDV 
(strain V) in 5' to 3' cDNA. The deduced amino acid 
sequences are shown for p40, P 23, gplB. p57 and pol. 
Note: the full amino acid sequence for pol after 
35 splicing modification is shown in sequence ID No. 10. 
The stars (*) indicate stop codons. Information on 
transcription and splicing of the genomic sequence 



is 
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found in Schneider, P. A. et al. t J. Virol. , 68:5007-5012 (1994) 
and Schneemann, A. , et al, J. Virol , 68:6514-6522 (1994), both 
references are hereby incorporated by reference in their 
entirety. 

5 FIG. 19 presents the partial cDNA genomic sequence 

(also listed as SEQ ID No. 33) of BDV strain HE/80. 
Position 1 to 2651 of this sequence corresponds to 
position 1397 through 4054 of the cDNA genomic sequence 
of BDV strain V. The cDNA sequence of BDV strain HE/80 
10 disclosed herein encodes part of the p23 and BDV 
polymerase proteins, and the complete gpl8 and p57 
proteins . 

The term "nucleotide sequence" as used herein, 
unless otherwise modified, includes both ribonucleic acid 

15 (RNA) and deoxyribonucleic acid (DNA) . 

The sequences in Table 1 include both native and 
synthetic sequences. Unless otherwise modified, the term 
"protein" as used herein encompasses both native and 
synthetic polypeptide and peptide. Synthetic protein 

20 includes recombinant and chemically synthesized protein. 
Unless otherwise indicated, "gpl8", "p57", and "pol" 
proteins include both their native and synthetic 
versions. "recplS", M recp57" and "recpol" are 

recombinant proteins of "gpl8", "p57", and "pol" 

25 proteins, respectively. The terms "p57" and "recp57" 
herein include both the predicted protein of about 57 
kDa, and the glycoprotein of about 94 kDa (G-protein) , 
further described below. 

Some of the nucleotide sequences disclosed are in 

30 the form of DNA. For example, SEQ ID No. 19 presents the 
BDV viral genomic sequence as cDNA of BDV viral genomic 
RNA. One skilled in the art would realize that the BDV 
viral genomic RNA is complementary to its cDNA that is 
shown in Figure 2. The term "BDV genomic nucleotide 

35 sequence" thus includes both the full cDNA and RNA 
sequences of the BDV genome. Further, as used in this 
application and claims r the SEQ ID Nos. and disclosed 
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sequences include: (1) the DNA sequences as disclosed, 
(2) the complementary nucleotide sequences (which may be 
RNA or DNA) to the disclosed sequences, (3) the 
corresponding RNA sequences to the listed DNA sequences 
5 wherein the Thymidine ("T") in the disclosed DNA 
sequences is replaced with Uracil ("U"), (4) nucleotide 
sequences wherein other nucleotides known in the art such 
as nucleotide analogs, replace those in the foregoing 
sequences, for example, 5-methyl-cytosine replacing 
10 cytosine, and (5) nucleotide sequences that are within a 
variance (with regard to the respective SEQ ID Nos. or 
disclosed nucleotide sequences) of at least about: 10%, 
preferably 28%, more preferably 30%, and most preferably 
35%. For example, Kishi, M. , et at. , "Sequence Variability 
15 of Boma Disease Virus Open Reading Frame II Found in 
Human Peripheral Blood Mononuclear Cells", /. Virol. , 
70(1) :635-640 (Jan. 1996), cloned, sequenced, and 
analyzed cDNA of BDV ORF-II which encodes p24, from the 
peripheral blood mononuclear cells of three psychiatric 
20 patients. Fifteen clones were studied. Intrapatient 
divergences of the BDV ORF-II nucleotide sequence were 
4.2% to 7.3%, 4.8% to 7.3%, and 2.8% to 7.1% of the three 
patients, leading to differences of 7.7% to 14.5%, 10.3% 
to 17.1%, and 6.0% to 16.2%, respectively, in the deduced 
25 amino acid sequence for BDV p24. Interpatient 
divergencies among the 15 clones were 5.9% to 12.7% at 
the nucleotide level and 12.8% to 28.2% at the amino 
acids level. The nucleotide sequences of the 15 human 
BDV ORF-II cDNA clones differed from those of horse 
30 strains V and He/80-1 by 4 .2% to 9 . 3% . This reference is 
hereby incorporated by reference in its entirety. The 
above discussion would analogously apply to RNA sequences 
disclosed in this application. 

Since nucleotide codons are redundant, also within 
35 the scope of this invention are equivalent nucleotide 
sequences which include: nucleotide sequences which code 
for the same proteins or equivalent proteins. Thus, 
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nucleotide sequences which encode substantially the same 
or functionally equivalent amino acid sequence may be 
used in the practice of the invention. 

The terms "BDV genomic nucleotide sequence", w pl8 n , 
5 w recpl8 n , n pol", "recpol", "p57 w , "recp57", as used in 
relation to nucleotide sequences are defined above, 
together with: (1) nucleotide sequences that are within 
a variance (with regard to the respective nucleotide 
sequences in Table 1) of at least about: 10%, preferably 

10 28%, more preferably 30%, and most preferably 35% (see 
also the discussion of Kishi, M., et al. /. Virol, 70(1), 
above) ; (2) nucleotide sequences that are capable of 
hybridizing to the coding sequences of the respective 
nucleotide sequences, under stringent hybridization 

15 conditions, (3) nucleotide sequences coding for gpl8, 
recpl8, p57, recp57, pol, and recpol proteins, and amino 
acid sequences of SEQ ID Nos. 6, 8, and 10 respectively; 
and (4) fragments of SEQ ID Nos. 6; 8; 10; nucleotide 
number 1 through 53 and nucleotide number 1880 through 

20 8910 of SEQ ID NO 19 and their fragments; or other 
nucleotide sequences which, for example, encode proteins 
having substantially the same biological 
characteristics/activities of gpl8, recpl8, p57, recp57, 
pol, recpol proteins, respectively. Preferably, the 

25 determinative biological characteristic/activity is the 
retention of at least one immunoepitope. Preferably, 
when used in an immunoassay for BDV, these proteins are 
immunoreactive with antibodies directed to BDV but not 
detectably immunoreactive with non-BDV specific 

30 antibodies found in a biological sample such as a serum 
sample. Alternatively, the nucleotide sequences can be 
nucleotide probes of at least 10 nucleotides in length. 
Preferably, when used in a hybridization assay for BDV, 
these probes do not detectably hybridize to the 

35 nucleotide sequences of non-BDV organisms which are found 
in a biological sample such as a serum sample. 
Alternatively, the nucleotide sequences hybridize to at 
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least 10 consecutive nucleotides in the coding sequences 
of the above listed nucleotide sequences. The nucleotide 
sequences include a nucleotide sequence which encodes a 
protein containing at least 8; more preferably, 5 to 6; 
5 and most preferably, 4 amino acids. Preferably, the 
protein is specific to BDV or retain one or more 
biological functions of BDV. Examples of such biological 
functions are: BDV's ability to bind a particular 
cellular receptor, BDV's ability to target its host cells 

10 {e.g. cells and tissues of the nervous system, bone marrow, 
peripheral blood, mononuclear cells or brain), and BDV's 
effects on the functions of cells infected by it. The 
discussion herein similarly applies to p23 , recp23. P 80, 
recp80 nucleotide sequences, and the cDNA nucleotide 

15 sequence of FIG. 19, e.g. in reference to their respective 

SEQ ID NOs and FIGg. 19. 

The terms »gpl8", "recpl8", «p57«, «recp57'\ "pol", 
and "recpol", as used in relation to proteins are, 
respectively, as defined above together with: (1) 
protein variants containing amino acid sequences that are 
within a variance (with regard to the amino acid 
sequences of SEQ ID Nos. 6, 8, and 10, respectively) of 
at least about: 5%, preferably 28%, more preferably 30%, 
and most preferably 35% (see also the discussion of 
Kishi, M. , etcd. J.Virol., 70(1), above); (2) the functional 
equivalents of these proteins and their variants, 
respectively; and (3) the derivatives, including 
fragments, of gpl8, recplS, P 57, recp57, pol, recpol, 
proteins and their variants, respectively. Preferably, 
30 when used in an immunoassay for BDV, these proteins are 
immunoreactive with antibodies directed to BDV but not 
detectably immunoreactive with non-BDV specific 
antibodies found in a biological sample such as a serum 
sample. Alternatively, these proteins each contains at 
35 least 8; more preferably. 5 to 6; and most preferably, 4 
amino acids. Preferably, the latter proteins are 
specific to BDV or retain one or more biological 



20 



25 
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functions of BDV. Examples of such biological functions 
are: BDV s ability to bind a particular cellular 
receptor, BDV's ability to target its host cells (e.g. 
cells and tissues of the nervous system, bone marrow, 
5 peripheral blood, mononuclear cells or brain), and BDV's 
effects on the functions of cells infected by it. The 
discussion herein similarly applies to p23, recp23, p80, 
and recp80 proteins, e.g. in reference to their respective 
SEQ ID NOs. 

10 Within the definition of "BDV" are BDV isotypes, 

strains, and BDV related viruses. The term "BDV proteins 
and their derivatives", includes BDV proteins, fragments 
of BDV proteins, proteins containing immunoepitopes of 
BDV, variants and functional equivalents of the 

15 foregoing. gpl8 and p57 are examples of BDV proteins. 
Preferably, the immunoepitope is specific to BDV. 

The variants can result from, e.g. substitution, 
insertion, or deletion of the amino acid sequences shown 
in Table 1. The derivatives of the proteins and their 

20 variants, include fragments of these proteins and their 
immunogenic epitopes. Preferably, each of the fragments 
contains at least one immunogenic epitope of BDV. More 
preferably, the fragment is capable of being bound by 
polyclonal antibodies directed to BDV. In the case of 

25 antibodies which recognize linear epitopes, they 
generally bind to epitopes defined by about 3 to 10 amino 
acids. Preferably, too, each variant retains at least 
one immunoepitope of BDV. Preferably the immunoepitope 
is specific to BDV. 

30 Two amino acid sequences are functionally equivalent 

if they have substantially the same biological 
activities. The proteins may be fused to other proteins, 
for example, signal sequence fusions may be employed in 
order to more expeditiously direct the secretion of the 

35 BDV protein. The heterologous signal replaces the native 
BDV signal, and when the resulting fusion is recognized, 
i.e. processed and cleaved by the host cell, the BDV 
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10 



protein is secreted. Signals are selected based on the 
intended host cell, and may include bacterial, yeast, 
insect, mammalian, and viral sequences. For example, the 
native BDV signal or the herpes gD glycoprotein signal is 
suitable for use in mammalian expression systems. 

Substitutional variants of the proteins disclosed 
herein are those in which at least one residue in the 
disclosed sequences has been removed and a different 
residue inserted in its place. Preferably, the amino 
acid change is conservative. For example, such 
substitutions generally are made in accordance with the 
following Table 2. 
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TApLB 2 



Original Residue 



Ala 
Arg 
Asn 
Asp 
Cys 
Gin 
Glu 
Gly 
His 
He 
Leu 
Lys 
Met 
Pbe 
Ser 
Thr 
Trp 
Tyr 
Val 



Exemplary Substitutions 

ser 



10 



15 



5 



lys 

gin; his 
glu 

ser; ala 
asn 
asp 
pro 

asn; gin 
leu; val 
ile; val 
arg; gin; glu 
leu; ile 
met; leu; tyr 
thr 



ser 



20 



tyr 

trp; phe 
ile ; leu 



Novel amino acid sequences, as well as isosteric 
analogs (amino acid or otherwise) , are included within 

25 the scope of this invention. 

A variant typically is made by site specific 
mutagenesis of the encoding nucleic acid, expression of 
the variant nucleic acid in recombinant cell culture and, 
optionally, purification from the cell culture for 

30 example by immunoaf f inity adsorption on a column to which 
are bound polyclonal antibodies directed against the 
original protein from which the variant is derived. 

Another class of variants are deletional variants. 
Deletions are characterized by the removal of one or more 

35 amino acid residues from the original protein sequence. 
Typically, deletions are used to affect the original 
protein's biological activities. However, deletions 
which preserve the biological activity or immune cross- 
reactivity of the original protein are preferred. 

40 Deletions of cysteine or other labile residues also 

may be desirable, for example in increasing the oxidative 
stability of the original protein. Deletion or 
substitutions of potential proteolysis sites, e.g. Arg Arg, 
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is accomplished by deleting one of the basic residues or 
substituting one by glutaminyl or histidyl residues. 

It will be understood that some variants may exhibit 
reduced or no biological activity. These variants 
5 nonetheless are useful as standards in immunoassays for 
BDV protein so long as they retain at least one 
immunogenic epitope of BDV protein. 

It is presently believed that the three-dimensional 
structure of the proteins of the present invention is 
10 important to their functioning as described herein. 
Therefore, all related structural analogs which mimic the 
active structure of those formed by the compositions or 
proteins claimed herein are specifically included within 
the scope of the present invention. 
15 Modified proteins are also within the contemplation 

of this patent application. These modifications may be 
deliberate, e.g., modifications obtained through site- 
directed mutagenesis, or may be accidental, e.g. , as those 
obtained through mutation of the hosts. 
20 Further, as is the case for all proteins, the 

precise chemical structure depends on a number of 
factors. As ionizable amino and carboxyl groups are 
present in the molecule, a particular protein may be 
obtained as an acidic or basic salt, or in neutral form. 
25 All such preparations which retain their activity when 
placed in suitable environmental conditions are included 
in the definition. Additionally, the primary amino acid 
sequence may be augmented by derivatization using sugar 
moieties (glycosylation) or by other supplementary 
30 molecules such as lipids, phosphate, acetyl groups and 
the like, more commonly by conjugation with saccharides. 
The primary amino acid structure may also aggregate to 
form complexes, most frequently dimers. Certain aspects 
of such augmentation are accomplished through post- 
35 translational processing systems of the producing host; 
other such modifications may be introduced in vitro. In any 
event, such modifications are included in the definition 
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so long as the activity of the protein is not destroyed. 
It is expected that such modifications may quantitatively 
or qualitatively affect the activity, either by enhancing 
or diminishing the activity of the protein in various 
5 assays . 

Individual amino acid residues in the chain may also 
be modified by oxidation, reduction, or other 
derivatization, and the protein may be cleaved to obtain 
fragments which retain activity. Such alterations which 
10 do not destroy activity do not remove the protein 
sequence from the definition. The following discusses 
some of the modifications in further detail by way of 
example . 

Thus, glycosylation variants are included within the 

15 scope of BDV. They include variants completely lacking 
in glycosylation (unglycosylated) and variants having at 
least one less glycosylated site than the native form 
(deglycosylated) as well as variants in which the 
glycosylation has been changed. Included are 

20 deglycosylated and unglycosylated amino acid sequence 
variants, deglycosylated and unglycosylated BDV and gpl8 
having the native, unmodified amino acid sequence of BDV 
and gpl8, and other glycosylation variants, e.g. of p57. 
For example, substitutional or deletional mutagenesis is 

25 employed to eliminate the N- or O-linked glycosylation 
sites of BDV or gpl8, e.g., an asparagine residue is 
deleted or substituted for by another basic residue such 
as lysine or histidine. Alternatively, flanking residues 
making up the glycosylation site are substituted or 

30 deleted, even though the asparagine residues remain 
unchanged, in order to prevent glycosylation by 
eliminating the glycosylation recognition site. 

Unglycosylated protein which has the amino acid 
sequence of the native protein is preferably produced in 

35 recombinant prokaryotic cell culture because prokaryotes 
are incapable of introducing glycosylation into 
polypeptides . 
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Glycosylation variants are produced by selecting 
appropriate host cells or by in vitro methods. Yeast, for 
example, introduce glycosylation which varies 
significantly from that of mammalian systems . Similarly, 
5 mammalian cells having a different species (e.g. hamster, 
murine, insect, porcine, bovine or ovine) or tissue 
origin {e.g. lung, liver, lymphoid, mesenchymal or 
epidermal) than the source of the BDV antigen are 
routinely screened for the ability to introduce variant 
10 glycosylation as characterized for example by elevated 
levels of mannose or variant ratios of mannose, fucose, 
sialic acid, and other sugars typically found in 
mammalian glycoproteins. In vitro processing of the 
proteins of the present invention typically is 
15 accomplished by enzymatic hydrolysis, e.g. endoglycosidase 
digestion . 

Derivatization with Afunctional agents is useful 
for preparing intermolecular aggregates of BDV proteins 
and their derivatives with polypeptides as well as for 
20 cross-linking the protein and derivatives to a water 
insoluble support matrix or surface for use in the assay 
or affinity purification of its ligands. In addition, a 
study of intrachain cross- links will provide direct 
information on conformational structure. Commonly used 
25 cross-linking agents include sulfhydryl reagents, Li- 
bia (diazoacetyl) -2-phenylethane, glutar aldehyde, N- 
hydroxysuccinimide esters, for example esters with 4- 
azidosalicylic acid, homobi functional imidoesters 
including disuccinimidyl esters such as 3,3' -dithiobis 
30 (succinimidyl-propionate) , and Afunctional maleimides 
such as bis -N-tnaleimido- 1,8 -octane. 

Certain post-translational derivatizations are the 
result of the action of recombinant host cells on the 
expressed polypeptide. Glutaminyl and asparaginyl 
35 residues are frequently post-translationally deamidated 
to the corresponding glutamyl and aspartyl residues. 
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Alternatively, these residues are deamidated under mildly 
acidic conditions. Either form of these residues falls 
within the scope of this invention. 

Other post-translational modifications include 
5 hydroxylation of proline and lysine, phosphorylation of 
hydroxyl groups of seryl or threonyl residues, 
methylation of the a-amino groups of lysine, arginine, 
and histidine side chains {T.E. Creighton, Proteins: Structure 
and Molecular Properties , W.H. Freeman & Co., San Francisco, pp 

10 79-86 (1983)}, acetylation of the N-terminal amine and, 
in some instances, amidation of the C- terminal carboxyl. 

The claimed proteins are preferably produced using 
recombinant technologies. The nucleotide, e.g. , DNA or 
RNA, sequences which encode the desired polypeptides are 

15 amplified by use of e.g. the polymerase chain reaction in 
the case of DNA (hereinalso referred to as "PCR"), and 
reverse transcriptase -polymerase chain reaction (RT-PCR) 
in the case of RNA. Oligonucleotide sequences to be used 
as primers which can specifically bind to the ends of the 

20 regions of interest are synthesized. After the desired 
region of the gene has been amplified the desired 
sequence is incorporated into an expression vector which 
is transformed into a host cell. The nucleotide sequence 
is then expressed by the host cell to give the desired 

25 polypeptide which is harvested from the host cell. 
Plant, bacterial, yeast, insect, viral and mammalian 
expression systems may be used. Vectors which may be 
used in these expression systems may contain fragments of 
plant, bacterial, yeast, insect, viral, and/or mammalian 

30 origins. 

Given the teachings contained herein, , one skilled in 
the art can create the sequences disclosed herein, either 
by hand or with an automated apparatus. As examples of 
the current state of the art relating to polynucleotide 
35 synthesis, one is directed to Maniatis et al„ Molecular Cloning— 
A Laboratory Manual, Cold Spring Harbor Laboratory (1984), 
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and Horvath etcd.. An Automated DNA Synthesizer Employing Deoxynucleoside 
y-Phosphoramidites , Methods in Enzymology 154 : 313-326, 1987. 

r^i f iraHon O f Mnrleotitte Sequences, Clonjn , q . and 
5 f iypreasion of the Disclosed Protein 

Alternatively, to obtain RNA encoding the proteins 
disclosed herein, one needs only to conduct hybridization 
screening with labelled BDV nucleotide sequence (usually, 
greater than about 20, and ordinarily about 50bp) in 
10 order to detect clones which contain homologous sequences 
in the cDNA libraries derived from cells or tissues of a 
particular animal, followed by analyzing the clones by 
restriction enzyme analysis and nucleic acid sequencing 
to identify full-length clones. The cell lines, cells 
15 and tissues are preferably from the nervous system, bone 
marrow, peripheral blood, mononuclear cells or brain of 
BDV infected animals. Examples of cells from the nervous 
system are: neurons, oligodendrocytes and astrocytes. 
The primers shown in Examples 1 to 4 and/or the methods 
20 shown therein may also be used. 

If full length clones are not present in the 
library, then appropriate fragments are recovered from 
the various clones and ligated at restriction sites 
common to the fragments to assemble a full-length clone. 
25 The techniques shown in this section are also useful 

for identifying and sequencing various isotypes and 
strains of BDV and BDV related viruses. The present 
invention discloses the nucleotide sequences of two 
strains of BDV; different strains of BDV may exist or 
30 arise due to mutation as in the case of human 
immunodeficiency virus (HIV) which constantly mutates and 
of which different strains are constantly being 
discovered. Thus, within the definition of BDV are other 
BDV isotypes and strains or viruses related to 
35 BDV ("BDV related viruses"). For example, the next 
section of the application describes diagnostic assays 
for BDV or related pathogenesis. The related. 
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pathogenesis include: (1) diseases caused by BDV; (2) 
opportunistic or attendant diseases arising from BDV 
infection; and (3) diseases caused by BDV related 
viruses. The BDV related viruses would be nonsegmented, 
5 negative -stranded, neurotropic, post transcriptionally 
modified (spliced) viruses which share some homology with 
BDV nucleotide or amino acid sequences. Patients 
infected by the BDV related viruses would manifest 
clinical symptoms similar to BDV infected patients, or to 

10 that of neurologic or neuropsychiatric diseases. 

Thus, DNA or RNA encoding various BDV isotypes and 
strains, and BDV related viruses, can be similarly 
obtained by probing libraries from cells and tissues, 
especially cells of the nervous system, of animals 

15 exhibiting clinical symptoms of BDV infection, neurologic 
or neuropsychiatric disease; or animals that have been 
purposely infected with BDV strains, isotypes or BDV 
related viruses, such as shown in Example 2. Once the 
DNA or RNA sequence of these strains, isotypes, or 

20 related viruses are known, primers based on the sequence 
may be used. The methods shown in Examples 1 and 2, and 
the primers shown therein may also be used to obtain the 
genomic nucleotide sequences. 

In general, prokaryotes are used for cloning of DNA 

25 sequences in constructing the vectors useful in the 
invention. For example, E. coli K12 strain 294 (ATCC No. 
31446) is particularly useful. Other microbial strains 
which may be used include E. coli B and E. coli X1776 (ATCC 
No. 31537) . These examples are illustrative rather than 

30 limiting. Alternatively, in vitro methods of cloning, e.g. 
polymerase chain reaction, are suitable. 

The proteins of this invention may be expressed 
directly in recombinant cell culture as an N-terminal 
methionyl analogue, or as a fusion with a polypeptide 

35 heterologous to the hybrid/portion, preferably a signal 
sequence or other polypeptide having a specific cleavage 
site at the N- terminus of the hybrid/portion. For 
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example, in constructing a prokaryotic secretory 
expression vector for port ion/ fragment of BDV protein, 
the native BDV signal is employed with hosts that 
recognize that signal. When the secretory leader is 
5 -recognized" by the host, the host signal peptidase is 
capable of cleaving a fusion of the leader polypeptide 
fused at its C-terminus to the desired mature BDV 
protein. For host prokaryotes that do not process the 
BDV signal, the signal is substituted by a prokaryotic 
10 signal selected, for example, from the group of the 
alkaline phosphatase, penicillinase, lpp or heat stable 
• enterotoxin II leaders. For yeast secretion the BDV 
signal may be substituted by the yeast invertase, alpha 
factor or acid phosphatase leaders. In mammalian cell 
15 expression, the native signal is satisfactory for 
mammalian BDV, although other mammalian secretory protein 
signals are suitable, as are viral secretory leaders, for 
example the herpes simplex gD signal. 

The proteins of the present invention may be 
20 expressed in any host cell, but preferably are 
synthesized in mammalian hosts. However, host cells from 
prokaryotes, fungi, yeast, insects and the like are also 
are used for expression. Exemplary prokaryotes are the 
strains suitable for cloning as well as E. coli W3110 (F-A- 
25 A-prototrophic. ATTC No. 27325) , other enterobacteriaceae 
such as Serratiamarcescans, bacilli and various pseudomonads . 

Expression hosts typically are transformed with DNA 
encoding the proteins of the present invention which has 

30 been ligated into an expression vector. Such vectors 
ordinarily carry a replication origin (although this is 
not necessary where chromosomal integration will occur) . 
Expression vectors also include marker sequences which 
are capable of providing phenotypic selection in 

35 transformed cells, as will be discussed further below. 
For example, E. coli is typically transformed using pBR322, 
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a plasmid derived from an £ coli species {Bolivar, etal., 
Gene 2:95 (1977)}. pBR322 contains genes for ampicillin 
and tetracycline resistance and thus provides easy means 
for identifying transformed cells, whether for purposes 
5 of cloning or expression. Expression vectors also 
optimally will contain sequences which are useful for the 
control of transcription and translation, e.g. , promoters 
and Shine -Dalgarno sequences (for prokaryotes) or 
promoters and enhancers (for mammalian cells) . The 

10 promoters may be, but need not be, inducible; even 
powerful constitutive promoters such as the CMV promoter 
for mammalian hosts may produce BDV proteins without host 
cell toxicity. While it is conceivable that expression 
vectors need not contain any expression control, 

15 replicative sequences or selection genes, their absence 
may hamper the identification of transformants and the 
achievement of high level peptide expression. 

Promoters suitable for use with prokaryotic hosts 
illustratively include the /N lactamase and lactose 

20 promoter systems {Chang etal., Nature 275:615 (1978); and 
Goeddel etal., Nature 281:544 (1979)}, alkaline phosphatase, 
the tryptophan (trp) promoter system (Goeddel, Nucleic Acids 
Res. 8:4057 (1980) and EPO Appln. Publ. No. 36,776) and 
hybrid promoters such as the tac promoter {H. de Boer et 

25 al., Proc. Natl. Acad Sci. USA 80:21-25 (1983)}. However, other 
functional bacterial promoters are suitable* Their 
nucleotide sequences are generally known, thereby 
enabling a skilled worker operably to ligate them to DNA 
encoding the proteins of the present invention 

30 {Siebenlist etal., Cell 20:269 (1980)} using linkers or 
adaptors to supply any required restriction sites. 
Promoters for use in bacterial systems also will contain 
a Shine -Dalgarno (S.D.) sequence operably linked to the 
DNA encoding the proteins of the present invention 

35 m addition to prokaryotes, eukaryotic microbes such 

as yeast or filamentous fungi are satisfactory. 
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10 



15 



Saccharomyces cerevisiae is the most commonly used eukaryotic 
microorganism, although a number of other strains are 
commonly available. The plasmid YRp7 is a satisfactory 
expression vector in yeast {Stinchcomb, etal. Nature 282:39 
(1979); Kingsman etal, Gene 7:141 (1979); Tschemper etal., 
Gene 10:157 (1980)}. This plasmid already contains the 
trpl gene which provides a selection marker for a mutant 
strain of yeast lacking the ability to grow in 
tryptophan, for example ATCC no. 44076 or PEP4-1 {Jones, 
Genetics 85:12 (1977)}. The presence of the trpl lesion as 
a characteristic of the yeast host cell genome then 
provides an effective environment for detecting 
transformation by growth in the absence of tryptophan. 
Alternatively, viral expression vectors such as 
retroviral vectors, baculoviral vectors and Semliki 
Forest viral vectors are used. The expression hosts of 
these vectors are known in the art. 

Suitable promoting sequences for use with yeast 
hosts include the promoters for 3-phosphoglycerate kinase 
{Hitzeman etal., J. Biol. Chem. 255:2073 (1980)} or other 
glycolytic enzymes {Hess et al., J. Adv. Enzyme Reg. 7:149 (1968) ; 
and Holland, Biochemistry 17:4900 (1978)}, such as enolase, 
glyceraldehyde- 3 -phosphate dehydrogenase, hexokinase, 
pyruvate decarboxylase, phosphofructokinase, glucose- 6- 
25 phosphate isoroerase, 3-phosphoglycerate mutase, pyruvate 
kinase, triosephosphate isomerase, phosphoglucos 
isomerase, and glucokinase. 

Other yeast promoters, which are inducible promoters 
having the additional advantage of transcription 
30 controlled by growth conditions., are the promoter regions 
for alcohol dehydrogenase 2, isocytochrome C, acid 
phosphatase, degradative enzymes associated with nitrogen 
metabolism, metal lothione in, glyceraldehyde -3 -phosphate 
dehydragenase, and enzymes responsible for maltose and 
35 galactose utilization. Suitable vectors and promoters 
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for use in yeast expression are further described in R. 
Hitzeman et al. , European Patent Publication No. 73,657a. 

Expression control sequences are known for 
eukaryotes. Virtually all eukaryotic genes have an AT- 
5 rich region located approximately 25 to 30 bases upstream 
from the site where transcription is initiated. Another 
sequence found 70 to 80 bases upstream from the start of 
transcription of many genes is a CXCAAT region where X 
may be any nucleotide. At the 3' end of most eukaryotic 

10 genes is an AATAAA sequence which may be the signal for 
addition of the poly A tail to the 3' end of the coding 
sequence. All of these sequences may be inserted into 
mammalian expression vectors. 

Suitable promoters for controlling transcription 

15 from vectors in mammalian host cells are readily obtained 
from various sources, for example, the genomes of viruses 
such as polyoma virus, SV40, adenovirus, MMV (steroid 
inducible) , retroviruses {e.g. the I/TR of BDV) , hepatitis-B 
virus and most preferably cytomegalovirus, or from 

20 heterologous mammalian promoters, e.g. the beta actin 
promoter. The early and late promoters of SV40 are 
conveniently obtained as an SV40 restriction fragment 
which also contains the SV40 viral origin of replication. 
{Fiers et ai t Nature 273:113 (1978)}. The immediate early 

25 promoter of the human cytomegalovirus is conventionally 
obtained as a Hindlll E restriction fragment. 
{Greenaway, P.J. et al., Gene 18:355-360 (1982)}. 

Transcription of a DNA encoding the proteins of the 
present invention by higher eukaryotes is increased by 

30 inserting an enhancer sequence into the vector. 
Enhancers are cis- acting elements of DNA, usually about 
from 10-300bp, that act on a promoter to increase its 
transcription. Enhancers are relatively orientation and 
position independent having been found 5' {Laimins et ai , 

35 Proc. Natl. Acad. Set, 78:993 (1981)} and 3' {Lusky, M.L., et al . 
Mol. Cell Bio. 3:1108 (1983)} to the transcription unit. 
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within an intron {Banerji, J.L. etal., Cell 33:729 (1983)} 
as well as within the coding sequence itself {Osborne, 
T.F., etal., MoL Cell Bio. 4:1293 (1984)}. Many enhancer 
sequences are now known from mammalian genes (globin, 
5 elastase, albumin, a-fetoprotein and insulin) . 
Typically, however, one will use an enhancer from a 
eukaryotic cell virus. Examples include the SV40 
enhancer on the late side of the replication origin (bp 
100-270), the cytomegalovirus early promoter enhancer, 
10 the polyoma enhancer on the late side of the replication 
origin, and adenovirus enhancers. 

Expression vectors used in eukaryotic host cells 
(yeast, fungi, insect, plant, animal, human or nucleated 
cells from other multicellular organisms) will also 
15 contain sequences necessary for the termination of 
transcription which may affect mRNA expression. These 
regions are transcribed as polyadenylated segments in the 
untranslated portion of the mRNA. The 3' untranslated 
regions also include transcription termination sites. 
20 Expression vectors may contain a selection gene, 

also termed a selectable marker. Examples of suitable 
selectable markers for mammalian cells are dihydrofolate 
reductase (DHFR) , thymidine kinase (TK) or neomycin. 
When such selectable markers are successfully transferred 
25 into a mammalian host cell, the transformed mammalian 
host cell is able to survive if placed under selective 
pressure. There are two widely used distinct categories 
of selective regimes. The first category is based on a 
cell's metabolism and the use of a mutant cell line which 
30 lacks the ability to grow independent of a supplemented 
media. Two examples are CHO DHFR- cells and mouse LTK 
cells. These cells lack the ability to grow without the 
addition of such nutrients as thymidine or hypoxanthine . 
Because these cells lack certain genes necessary for a 
35 complete nucleotide synthesis pathway, they cannot 
survive unless the missing nucleotides are provided in a 
supplemented media. An alternative to supplementing the 
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media is to introduce an intact DHFR or TK gene into 
calls lacking the respective genes, thus altering their 
growth requirements. Individual cells which were not 
transformed with the DHFR or TK gene will not be capable 
5 of survival in non- supplemented media. 

The second category of selective regimes is dominant 
selection which refers to a selection scheme used in any 
cell type and does not require the use of a mutant cell 
line. These schemes typically use a drug to arrest 

10 growth of a host cell. Those cells which are 
successfully transformed with a heterologous gene express 
a protein conferring drug resistance and thus survive the 
selection regimen. Examples of such dominant selection 
use the drugs neomycin {Southern et al. , J. Molec. AppL Genet. 

15 1:327 (1982)}, mycophenolic acid {Mulligan et al., Science 
209:1422 (1980)} or hygromycin {Sugden etai, Mol Cell Biol. 
5:410-413 (1985)}. The three examples given above employ 
bacterial genes under eukaryotic control to convey 
resistance to the appropriate drug G418 or neomycin 

20 (geneticin) , xgpt (mycophenolic acid) or hygromycin, 
respectively. 

"Amplification" refers to the increase or 
replication of an isolated region within a cell's 
chromosomal DNA. Amplification is achieved using a 

25 selection agent, e.g. methotrexate (MTX) which inactivates 
DHFR. Amplification or the making of successive copies 
of the DHFR gene results in greater amounts of DHFR being 
produced in the face of greater amounts of MTX. 
Amplif ication pressure is applied notwithstanding the 

30 presence of endogenous DHFR, by adding ever greater 
amounts of MTX to the media. Amplification of a desired 
gene can be achieved by cotransfecting a mammalian host 
cell with a plasmid having a DNA encoding a desired 
protein and the DHFR or amplification gene permitting 

35 cointegration. One ensures that the cell requires more 
DHFR, which requirement is met by replication of the 
selection gene, by selecting only for cells that can grow 
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in the presence of ever-greater MTX concentration 
long as the gene encoding a desired heterologous protein 
has cointegrated with the selection gene replication of 
this gene gives rise to replication of the gene encoding 
5 the desired protein. The result is that increased copies 
of the gene, i.e. an amplified gene, encoding the desired 
heterologous protein express more of the desired protein. 

Suitable eukaryotic host cells for expressing the 
proteins include monkey kidney CV1 line transformed by 
10 SV40 (COS-7, ATCC CRL 1651); human embryonic kidney line 
(293 or 293 cells subcloned for growth in suspension 
culture, {Graham, F.L. et al J. Gen Virol. 36:59 (1977)}; 
baby hamster kidney cells (BHK, ATCC CCL 10); Chinese 
hamster ovary-cells-DHFR {CHO, Urlaub and Chasin, Proc. Natl. 
Acad. Set. (USA) 77:4216. (1980)}; mouse Sertoli cells {TM4, 
Mather, J.P. , Biol. Reprod 23:243-251 (1980)}; monkey kidney 
cells (CV1 ATCC CCL 70) ; african green monkey kidney 
cells (VERO-76, ATCC CRL-1587) ; human cervical carcinoma 
cells (HELA, ATCC CCL 2) ; canine kidney cells (MDCK, ATCC 
CCL 34) ; buffalo rat liver cells (BRL 3A, ATCC CRL 1442) ; 
human lung cells (W138, ATCC CCL 75); human liver cells 
(Hep G2, HB 8065); mouse mammary tumor (MMT 060562, ATCC 
CCL51) ;' TRI cells {Mather, J.P. , et al.. Annals N.Y. Acad Sci. 
383 :44-68 (1982)}; and C 6 glial cell (ATCC CCL 107) . 

Construction of suitable vectors containing the 
desired coding and control sequences employ standard 
ligation techniques. Isolated plasmids or DNA fragments 
are cleaved, tailored, and religated in the form desired 
to form the plasmids required. 

For analysis to confirm correct sequences in 
plasmids constructed, the ligation mixtures are used to 
transform E. coli K12 strain 294 (ATCC 31446) and successful 
transformants selected by ampicillin or tetracycline 
resistance where appropriate. Plasmids from the 
transformants are prepared, analyzed by restriction 
and/or sequenced by the method of Messing etal, Nucleic Acids 
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Res. 9:309 (1981) or by the method of Sanger etal, Proc. Natl 
Acad Sci. t (USA), 74:5463 (1977) . 

Host cells are transformed with the expression 
vectors of this invention and cultured in conventional 
5 nutrient media modified as appropriate for inducing 
promoters , selecting transformants or amplifying the 
genes encoding the desired sequences. The culture 
conditions, such as temperature, pH and the like, are 
those previously used with the host cell selected for 
10 expression, and will be apparent to the ordinarily 
skilled artisan. 

The host cells referred to in this disclosure 
encompass cells in in vitro culture as well as cells which 
are within a host animal. 

15 

Diaqnpstic , Proqnos tig , and MQflitorjLgq Upes of BDV 

proteins and their derivatives 

Another aspect of the present invention presents 
assays for detecting ligands, e.g. , in the biological 

20 samples of a test organism, which bind BDV protein (s) or 
derivatives thereof. These assays are useful as 
diagnostic tests for: (1) infection by BDV or related 
pathogenesis; and (2) neurologic and neuropsychiatry 
disease not due to BDV infection. 

25 The preferred assays are immunoassays which detect 

antibodies to BDV proteins or its derivatives that are 
antigenic (herein referred to as "BDV antigen"). The 
test organism can be human or other animals, such as 
cats, fowls, ostriches, and horses. The biological 

30 samples may be biological fluids such as whole blood, 
serum, plasma, cerebral spinal fluid, or synovial fluid. 
Preferably, BDV proteins or its derivatives are used to 
detect the ligand by binding to it. Preferably, the 
ligand is an antibody directed to the polypeptides, and 

35 BDV antigens are used to detect the antibody. For 
example, the assay can be used to detect antibodies 
against BDV in biological fluids. 
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Alternatively, antibodies to BDV protein (s) or their 
derivatives can be used to screen for BDV proteins, e.g., 
in the biological samples of a test organism. Similarly, 
the alternative detection of antibodies or antigen 
5 applies to each of the assay formats described below. 

Thus, an example of the assay is an enzyme 
immunoassay. In an example of a direct assay, these 
polypeptides serve as antigens and are attached to a 
solid phase and then incubated with patient sera. Human 
10 serum or plasma is preferably diluted in a sample diluent 
before incubation. If antibodies to BDV are present in 
the sample they will form an antigen-antibody complex 
with the polypeptides and become affixed to the solid 
phase . 

15 After the antigen -antibody complex has formed, 

unbound materials and reagents are removed by washing the 
solid phase and the antigen -antibody complex is reacted 
with a solution containing labelled antibodies directed 
against the first type of antibody. For example, the 
20 labelled antibody can be horseradish peroxidase- labeled 
goat antibody. This peroxidase labelled antibody then 
binds to the antigen -antibody complex already affixed to 
the solid phase. In a final reaction the horseradish 
peroxidase is contacted with o-phenylenediamine and 
25 hydrogen peroxide which results in a yellow-orange color. 
The intensity of the color is proportional to the amount 
of antibody which initially binds to the polypeptide 
affixed to the solid phase. 

Another assay format provides for an antibody- 
30 capture assay in which ant i- immunoglobulin antibody on 
the solid phase captures the patient's antibody, which is 
then reacted with the BDV antigen. The application of 
this format is similar to the serological assay of Lyme 
disease taught in Berardi et al., J. Infect. Dis. 158:754-760 
35 (1988) . If antibody to BDV is present, it captures the 
BDV antigen, and the bound BDV antigen is detected by 
means of labelled polyclonal or monoclonal antibodies 
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directed against the BDV antigen. The antibody- capture 
assay is particularly useful for and can increase the 
sensitivity of detection of IgM and IgG to BDV antigens. 
In an example of this assay, the fluid sample is first 
5 contacted with a solid support containing a bound 
antibody capable of binding the mu-chain of IgM or the 
gamma-chain of IgG antibodies. Specific antibody is 
detected by reacting this with the BDV antigens followed 
by non- human antibody to the BDV antigens. The non- human 

10 antibody is generally labelled for detection. It is 
believed that this antibody -capture immunoassay format 
will have increased sensitivity, especially for IgM. 
Alternatively, one can forego the non-human antibody and 
instead label the BDV antigens for direct detection. 

15 Another assay format provides for an immunodot assay 

for identifying the presence of an antibody that is 
immunologically reactive with specific BDV antigens by 
contacting a sample with the BDV antigens bound to a 
solid support under conditions suitable for complexing 

20 the antibody with the BDV antigens and detecting the 
antibody-antigen complex by reacting the complex. 

Suitable methods and reagents for detecting an 
antibody- antigen complex in am assay of the present 
invention are commercially available or known ,in the 

25 relevant art. For example, the detector antibodies or 
polypeptides may be labelled with enzymatic, 
radioisotopic, fluorescent, luminescent, or 
chemi luminescent label. These labels may be used in 
hapten- labelled antihapten detection systems according to 

30 known procedures, for example, a biotin-labelled 
antibiotin system may be used to detect an antibody- 
antigen complex. 

In all of the assays, the sample is preferably 
diluted before contacting the BDV antigen absorbed on a 

35 solid support. Solid support materials may include 
cellulose materials, such as paper and nitrocellulose; 
natural and synthetic polymeric materials, such as 
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polyacrylamide, polystyrene, and cotton; porous gels such 
as silica gel, agarose, dextran and gelatin; and 
inorganic materials such as deactivated alumina, 
magnesium sulfate and glass. Suitable solid support 
materials may be used in assays in a variety of well 
known physical configurations, including microtiter 
wells, test tubes, beads, strips, membranes, and 
microparticles. A preferred solid support for a non- 
immunodot assay is a polystyrene microwell, polystyrene 
beads, or polystyrene microparticles. A preferred solid 
support for an immunodot assay is nitrocellulose or nylon 
membrane . 

In particular, the invention presents an ELISA which 
is a rapid, sensitive, and inexpensive diagnostic test. 
15 The preferred ELISAs are based on recombinant BDV 
proteins recp40, recp23, and recpl8. These assays are 
more sensitive and rapid than prior art methods employed 
for serologic diagnosis of infection, such as Western 
blot, indirect immunof luorescent test or 
20 immunoprecipitation . 

Examples of the neurologic and neuropsychiatry 
diseases that can be diagnosed include diseases of the 
nervous system such as schizophrenia, depressive 
disorders {e.g., bipolar depression), multiple sclerosis 
25 and AIDS-related encephalopathy. The finding is based on 
applicants' analysis of the art. Although the virus has 
not been recovered from human subjects, antibodies 
reactive with BDV proteins have been found in patients 
with bipolar depression, schizophrenia, or AIDS-related 
30 encephalopathy {Bode, L., et al.. Arch Virol. Suppl. . 7:159- 
167 (1993); Bode, L. , et al., Lancet, ii:689 (1988) and 
Rott, R., et al., Science 228:755-756 (1985)). BDV has a 
unique tropism for specific brain regions. Viral nucleic 
acids and disease- specif ic proteins have been found in 
35 highest concentrations in the hippocampus and limbic 
circuits, prefrontal and cingulate cortices, and 
brainstem nuclei {Carbone, K. , et al . , J. Neuropathol. Exp. 
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Neurol, 50:205-214 (1991); Ludwig, H., et al., Prog. Med 
Virol 35:107-151 (1988) and Solbrig, M. V. , et al., abstr. 
10, Abstr. 1992 Am. Acad. Neurol. Annu. Meet., (1992)}. 
Three BDV proteins, p40, p23 and gpl8 (disclosed in 
5 Example 2 below) have been identified in infected cells 
and tissues {Ludwig, H., et al . , Prog. Med Virol 35:107-151 
(1988) and Thiedemann, N., et al., J. Gen Virol, 73:1057- 
1064 (1992)}. cDNAs for p40 {Lipkin, W. I., et al., Proc. 
Natl Acad Sci USA , 87:4184-4188 (1990); McClure, M. A., et 

10 al., J. Virol, 66:6572-6577 (1992) and Pyper, J. M. , et al., 
Virology, 195:229-238 (1993)} and p23 {Lipkin, W. I., et 
al., Proc. Natl Acad Sci USA, 87:4184-4188 (1990) ; Thierer, 
J., et al., J. Gen. Virol, 73:413-416 (1992) and VandeWoude, 
S., et al., Science, 250:1276-1281 (1990)} have been 

15 isolated, and complementary sequences to open reading 
frames (ORFs) for these proteins have been mapped to the 
viral genome {Briese, T., et al., Proc. Natl Acad Sci. USA 
91:4362-4366 (1994) which is incorporated into Example 1 
of this application; and Cubitt, B., et al., J. Virol, 

20 68:1382-1996 (1994)}. 

The assay can also be used to monitor the diseases 
by monitoring the titer of such ligands. The titer of 
the ligands, and the specific viral proteins that it is 
immunoreactive with, can also be prognosticative of the 

25 diseases. 

Thus, an application of this invention may involve 
contacting the test subject's biological sample, such as 
serum, with a panel consisting of different immunogenic 
fragments of BDV protein or its derivatives. These 

30 proteins may be synthetic or native proteins, though 
recombinant proteins are preferred. Such a panel may 
consist of, for example, recp23, recp40, recp57, recpol 
and recpl8. If the serum is immunoreactive with at least 
one of the fragments, it indicates that the test subject 

35 may either be suffering from (1) BDV or related 
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pathogenesis; or (2) neurologic and neuropsychiatric 
disease not due to BDV infection. Further, given a fixed 
amount of sample tested, the amount (i.e. percentage) of 
ligands immunoreactive with the BDV proteins may also be 
5 indicative of the severity of the disease and thus its 
prognosis. Generally, the higher the percentage of 
ligands that are immunoreactive, the more severe the 
disease and the poorer the prognosis. Thus, the assay 
may also be used to monitor the progress of the disease. 

10 in particular, if the test subject is undergoing 
treatment for the disease, the assay may be used to 
monitor the efficacy of the drug and treatment regimen. 
Such monitoring may involve assaying for the ligand titer 
and/or the specific BDV immunogenic epitopes which the 

15 ligand binds to. 

H ybridization Diag nostic Assays 

Oligonucleotides { "probes") that are unique, or 
relatively unique to BDV in a test sample, are useful for 

20 diagnosing BDV infections. Nucleotide hybridization 
assay may be used, whereby nucleic acids from a patient's 
biological sample are contacted to the primers or BDV 
restriction fragments under hybridization condition, and 
the hybridization products are detected. This method 

25 could be used to detect viral genomic RNA or mRNA. 
Conventional Western or Northern Blot analysis, RT-PCR or 
PCR and ligase chain reaction (LCR) may be used as the 
basis of the assay, these techniques are known to those 
skilled in the art. PCR and LCR techniques are widely 

30 available in the art. For example, the basic PCR 
techniques are described in United States Patent Nos. 
4,683,202; 4,683,195; 4,800,159; and 4,965,188. The 
basic LCR techniques are described in EPA-320,308; EPA- 
439,182; EPA-336,731; WO 89/09835; WO 89/12696, and WO 

35 90/01069. 

Since the present invention presents the full 
nucleotide sequence of the genomic BDV nucleotide 
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sequence, these probes can be identified by comparing 
this sequence with the sequences of other organisms which 
may contaminate a test sample. Such comparison can be 
conducted as described in Example 1 below or using 
5 methods known in the art. The probes preferably contain 
at least 10 contiguous nucleotides or at least 30 
contiguous nucleotides with at least 60% homology along 
the length of the BDV nucleotide sequence being compared. 
Examples of such probes and methods for conducting the 
10 PCR for detection are as described in Examples 1 and 2. 

Assay Kits 

The present invention also encompasses immunoassay 
kits containing BDV antigen (s) , preferably each antigen 

15 per container, in a concentration suitable for use in 
immunoassay. In the kits, the BDV antigens may be bound 
to a solid support and where needed, the kits may include 
sample preparation reagents, wash reagents, detection 
reagents and signal producing reagents. 

20 Also included are assay kits for nucleotide 

hybridization assays which include probes which are 
specific for BDV or its derivatives. The kits may also 
include sample preparation reagents, wash reagents, 
detection reagents and signal producing reagents. 

25 

Therapeutic Us es of Antibodies Directed to BDV proteins 
and Their Derivatives 

Another aspect of the invention presents methods, 
using antibodies directed to BDV proteins or derivatives, 

30 for treating: (1) BDV infection or related pathogenesis; 
and (2) neurologic and neuropsychiatric disease not due 
to BDV infection. Examples of such antibodies are those 
specific to gpl8 and p57. The antibodies may be 
administered using methods known in the art. Preferably, 

35 this involves passive administration of these antibodies, 
such as those described in Example 4. 
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T>„ r «Hri«a Paaful Por Diagnostic* and Therapeutics 

Another aspect of the invention presents peptides e.g. 
the truncated fragments and peptides disclosed in 
"EXAMPLE 5", below, containing at least one BDV 
5 iiranunoepitope. These peptides can be used in diagnostic 
assays to detect the presence of a patient's antibodies 
agaisnt BDV. Thus, the peptides are useful for the 
assays described in the section: "Diagnostic. Prognostic, 

*r>* Mrniito^nc Uses of ftPV proteins and tfceir 

10 derivatives" . For example, as shown in Example 3 below, 
recp40, recp23, and recplS have proved useful for 
detecting BDV infections. Thus, the epitopes of these 
recombinant proteins can be mapped, and smaller peptides 
containing these epitopes and routinely tested for their 
15 immunoreactivity with antibodies to BDV, e.g. using the EUSA 
method shown in Example 3. 

The above peptides can also be used to raise antibodies that may serve as 
therapeutics against BDV infections such as shown in Example 4 and as described 
in the section: -iwneutlc Uses of Antibodies Directed to, BDV 
20 r ™i-A-in S anc * TJiej n»rivatives" . Examples of methods for 
synthesizing peptide fragments are described in Stuart 
and Young in "Solid Phase Peptide Synthesis", 2nd ed., 
Pierce Chemical Co. (1984). It is contemplated that 
antibodies which precipitate BDV viral particles would be 
25 useful for therapeutic uses. In particular, these 
antibodies are raised against proteins, and their 
fragments, expressed on the surface of BDV. It is 
further contemplated that antibodies against gpl8, P 57 
and their fragments, especially antibodies that 
30 precipitate BDV viral proteins would be useful for 
treating or preventing the disease (1) BDV infection or 
related pathogenesis; and (2) neurologic and 
neuropsychiatry disease not due to BDV infection. 

Thus, fragments of BDV proteins, in particular gpl8 
35 and p57 and their fragments, can be made starting from 
either end of their C-termini and NH 2 -termini. For 



WO 96/21020 



PCI7US96/00418 



39 

example, these fragments can be tested according to the 
ELISA method shown in Example 3 against, e.g. sera from 
horses, rats, or human patients infected with BDV. The 
fragments that react with the sera would be useful for 
5 detecting the disease and would be useful for raising 
therapeutic antibodies to treat the disease. Since 
different animals may react to different epitopes of BDV 
proteins, one may even tailor the screening test by using 
the serum from the same species of animal for which one 

10 seeks to develop an assay or therapeutic. For example, 
if one is seeking a diagnostic test or therapeutic for 
humans, the sera tested will be preferably that from 
human patients. Included in this invention are other 
methods, known in the art, for identifying the 

15 immunoreactive epitopes of a protein and raising 
antibodies thereto. Further, since antibodies which are 
immunoreactive with BDV protein may also be found in the 
sera of patients with neurologic and neuropsychiatric 
disease not necessarily due to BDV infection, the above 

20 peptides and antibodies raised thereto may also find 
usefulness in diagnosing, monitoring and treating these 
patients. Additionally, these peptides may be identified 
by their immunoreactivity with sera from patients 
suffering from neurologic and neuropsychiatric disease 

25 not due to BDV infection. Thus, as described in this 
application, the disease, patient sera to be tested, the 
diagnostic, monitoring and therapeutic uses are not 
limited to BDV, and include (1) BDV infection or related 
pathogenesis; and (2) neurologic and neuropsychiatric 

30 disease not due to BDV infection. Further, one can 
screen for therapeutic ligands or chemicals which bind 
these peptides. These therapeutic chemicals then may be 
tested for their therapeutic effect against the above 
diseases. Other ligands or chemicals which bind the 

35 therapeutic ligands or chemicals can be tested for their 
ability to bind patients' antisera or antibodies and are 
thus useful as diagnostics for the diseases. 
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Preferably, the above peptides and antibodies are 
also respectively tested for their crossreactivity with 
antibodies raised by and proteins from organisms 
unrelated to the above diseases but commonly found in the 
5 test sample {e.g. patient's biological sample). Peptides 
and antibodies that are highly non-specific are 
preferably not used. To obtain peptides of high 
specificity, one may also compare the amino acid sequence 
of BDV protein with that of known contaminating proteins 
10 in the test sample. The fragments that are unique, or 
relatively so, to BDV are then chosen for further 
screening as described above, e.g. for immunoreactivity 
with patient's test sample. These comparison can also be 
done on the nucleotide sequence level. 

15 

Mo^nri for P r-oHn^ina Antibodies to BDV and its . 
Derivative 

Besides whole immunoglobulins, antibodies herein 
include antigen binding fragments of the immunoglobulins. 

20 Examples of these fragments are Fab, F(ab')2 and Fv. 
Such fragments can be produced by known methods. Unless 
otherwise indicated, antibodies herein also include: 
polyclonal and monoclonal antibodies, monospecific 
antibodies, and antisera which includes monospecific 

25 antisera. 

Antibodies to BDV proteins and their derivatives can 
be produced using standard procedures known in the art. 
For example, antibodies can be produced by inoculating a 
host animal such as a rabbit, rat, goat, mouse, etc., 

30 with BDV proteins and their derivatives. Before 
inoculation, the polypeptides or fragments may be first 
conjugated with keyhole limpet hemocyanin (KLH) or bovine 
serum albumin (BSA) . After an appropriate time period 
for the animal to produce antibodies to the polypeptides 

35 or fragments, the anti-serum of the animal is collected 
and the polyclonal antibodies separated from the anti- 
serum using techniques known in the art. Monoclonal 
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antibodies can be produced by the method described in 
Kohler and Milstein {Nature, 256:495-497, 1975) by 
immortalizing spleen cells from an animal inoculated with 
the polypeptides or fragments thereof. The 
5 immortalization of the spleen cell is usually conducted 
by fusing the cell with an immortal cell line, for 
example, a myeloma cell line, of the same or different 
species as the inoculated animal. The immortalized fused 
cell can then be cloned and the cell screened for 

10 production of the desired antibody. 

The antibodies may also be recombinant monoclonal 
antibodies produced according to the methods disclosed in 
Reading, United States Patent No. 4,474,893, or Cabilly 
et al., United States Patent No. 4,816,567. The 

15 antibodies may also be chemically constructed according 
to the method disclosed in Segel et al . , United States 
Patent No. 4,676,980. 

While the invention is demonstrated using mouse 
monoclonal antibodies and rat monospecific antisera, the 

20 invention is not so limited. In fact, human antibodies 
may be used and may prove to be preferable. The latter 
is especially so if the antibodies are used as 
therapeutics for humans, as there would be less 
immunorejection from the human patients receiving these 

25 antibodies. Such antibodies can be obtained by using 
human hybridomas {Cote et al., Monoclonal Antibodies and 
Cancer Therapy, Alan R. Liss, p. 77 (1985)}. In fact, 
according to the invention, techniques developed for the 
production of chimeric antibodies {Morrison et al., Proc. 

30 Natl. Acad. Sci., 81:6851 (1984); Neuberger et al., 
Nature, 312: 604 (1984); Takeda et al., Nature, 314: 452 
(1985)} by splicing the genes from a mouse antibody 
molecule of appropriate antigen specificity together with 
genes from a human antibody molecule of appropriate 

35 biological activity (such as ability to activate human 
complement and mediate antibody- dependent cell -mediated 
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cytotoxicity) can be used; such antibodies are within the 
scope of this invention. 

VACCINE 

5 By providing the nucleotide and amino acid sequences 

of the BDV genome and BDV proteins, respectively, this 
application enables the production of recombinant BDV {e.g. 
using the technique shown in Schnell, M. J., EMBO J., 13: 
4195-4203 (1994)} which can then be attenuated, e.g. by 
10 mutagenesis, heat or formaldehyde treatment, to be used 
as vaccine against (1) BDV infection or related 
pathogenesis; and (2) neurologic and neuropsychiatry 
disease not due to BDV infection. BDV sequences, their 
mutagenized sequences or fragments thereof, may be 
15 administered, e.g. by direct injection, or incorporated 
into a vector and administered e.g. by direct injection, 
into patients. Examples of the fragments are the 
truncated fragments and peptides disclosed in " EXAMPLE 
5", below. The injections may be by means of a gene gun. 
20 gpl8, p57, pol, and proteins produced by the mutagenized 
or fragmented sequences may also serve as vaccines. 
Proteinaceous vaccines may be delivered orally, 
intravenously, intraperitoneal^, or intramuscularly. 
The vaccine may also be contained in a physiologically 
25 compatible solution. 

r ™r v j rf ? vector Baaed Delivery Sysfrep 

Another aspect of the invention presents: (A) a 
BDV-roediated gene transfer for the incorporation and 
30 expression of eukaryotic or prokaryotic foreign genes 
into another eukaryotic or prokaryotic host; and (B) an 
in vitro BDV-mediated delivery of gene(s) or chemical (s) to 

a target cell. 

in Method A, one or more desired genes are inserted 
35 into the BDV viral vector. The desired gene transfer can 
be achieved through in vitro transf ection of a cell or cell 
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line by the resulting BDV viral vector. The transfected 
cell or cell line thus expresses the gene(s) of interest 
and the expression product (s) are harvested. 
Alternatively, the transfected cell or cell line is later 
5 transplanted into a host, e.g. an animal such as a human, 
in need of the gene product (s). In this case, the 
gene(s) is expressed in vivo. The generation of infectious 
non- segmented, neurotropic, negative -stranded RNA virus 
entirely from cloned cDNA, has been described in the case 

10 of rabies virus {Schnell, M. J., et al., EMBO J., 13(18): 
4195-4203 (1994)}. The insertion of foreign gene(s) into 
the BDV viral vector is based on prior art teachings for 
other viral vectors, which may include insertion of 
promoters or regulators to control expression of the 

15 foreign gene(s). The transfection and gene therapy is 
similarly based on. prior art teaching for viral vectors. 

Such teachings abound, see e.g. , U.S. Patent No. 
5,219,740 to Miller et al., Jun 15, 1993; U.S. Patent No. 
5,256,553 to Overell, Oct. 26, 1993; and WO 91/12329, 

20 assigned to Board of Regents, the University of Texas 
System, international publication date, Aug. 22, 1991. 

Method B utilizes the unique tropism of BDV for 
specific regions and cells of the nervous systems, e.g. 
neural cells. Thus, BDV vector cam be used for in vivo 

25 delivery of chemicals or desired genes to these regions. 
For example, infectious recombinant BDV containing the 
gene of interest can be used to infect the specific 
target cells of BDV in a host animal. The host can be a 
human suffering from deficiency, lack of, or a 

30 malfunctioning of the gene product. The general gene 
therapy methods can be based on prior art teaching e.g. 
the references cited for Method A, such as WO 91/12329. 

In the case of BDV viral vectors, these genes can be 
those responsible for the survival, proliferation, and 

35 proper functioning of the nervous system. For example, 
in neurodegenerative diseases, the cells in the patients' 
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nervous system suffer premature death, and these cells 
are not regenerated, eventually causing the patients to 
die. The inserted gene(s) may supplement or replace the 
dysfuntional genets) in these patients to provide gene 
5 product (s) necessary for continued survival and 
proliferation of these cells. Examples of the inserted 
genes include genes coding for: neurotransmitters, 
cytokines, growth factors, receptors for the foregoing, 
enzymes for activation of therapeutic drugs administered 
10 to the patients. 

Alternatively, the viral vector may contain a 
nucleotide sequence coding for a toxin. These vectors 
would infect the host's cells in vivo, express the toxin and 
kill the infected cells. The targeted cells are 
15 preferably neoplastic cells, or cells infected by or 
harboring pathogenic organisms. The vector is preferably 
further designed to selectively target these cells over 
normal cells. One means to target the desired cells is 
by localized injection of the recombinant virus, 
20 containing the desired gene, near the target of interest. 
However, for BDV based gene therapy, the vector or 
recombinant virus may be delivered peripherally, i.e. into 
subcutaneous tissue, peripheral nerve, or 
intramuscularly. The neurotropism of the recombinant 
25 virus allows it to migrate towards cells of the nervous 
system to transfect or infect them. 

The BDV viral vector is an especially good vehicle 
for gene therapy and in vivo chemical delivery. It has 
several advantages over the viral vectors known in the 
30 art, the most common of which are retroviral vectors. 
Retroviral vectors require replication of its host cells 
for transfection. Therefore, retroviral vectors can only 
be used with dividing/mitotic cells. In contrast, BDV 
vectors are autonomous, self -replicating vectors and thus 
35 can transfect both dividing and non-dividing cells. 
Thus, BDV is particularly effective for transfect ing 
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nerve cells that normally do not divide and for which BDV 
is tropic. 

Further, BDV does not have a latent stage in its 
lifecycle, after transfecting a host cell. It thus will 
5 continue to express the desired gene once it has 
transfected a cell. This is unlike some viral vectors 
currently used in the art, such as the herpes viral 
vector that may enter a latent stage after transfection 
and thus not express the desired gene product in the 
10 transfected cell. BDV is also unique in that it is a 
slow growing virus and is not lytic. Thus, chances of 
the virus lysing and killing the host cells are 
nonexistent . 

As a further safeguard, the BDV viral vectors may be 

15 made infective but replication-defective, rendering them 
useful vectors which are unable to produce infective 
virus, following introduction into a cell. For 
initiation of productive infection of BDV, a nucleocapsid 
containing BDV genomic RNA is required, from which 

20 primary transcription of mRNAs and ensuing autonomous and 
regulated expression of all BDV proteins occurs. Thus, 
to render the viral vector replication-defective, one may 
mutate the nucleocapsid protein produced by recombinant 
virus to prevent encapsidation of newly synthesized 

25 genomic RNA. Additionally, the host cell should 
preferably be devoid of infectious helper virus which may 
assist in replication of the BDV. Further, unlike 

retroviruses and herpes viruses, BDV does not cause 
disease in and of itself. The deleterious effect of BDV 

30 infection is actually caused by the host's immune- 
mediated rejection of BDV and BDV antigen expressed on 
infected cells. The rejection involves cellular immune 
response which activates the host's effector lymphocytes 
which then kill the transfected cells. Antibodies appear 

35 not to be as important in the host's immune response. 
Thus, one means to avoid Borna disease is to interfere 
with, avoid, or suppress the host's ability to recognize 
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or mount an immune response to BDV infected cells. For 
example, immune response in the host is triggered when T 
lymphocytes recognize a complex of major 
histocompatibility complex (MHC) and foreign antigen (in 
5 this case, BDV proteins) expressed on the host cell's 
surface. Thus, to reduce the host's immune response, one 
may choose to interfere with or prevent the expression of 
MHC on the transfected cells. This may be achieved by 
inserting, into the BDV viral vector, a nucleotide 

10 sequence which codes for a mRNA (i.e. an antisense mRKA) 
which would bind the mRNA coding for the component of MHC 
(•'mRNAHHc") and prevent the translation and expression of 
MHC in the transfected cell. Absent MHC, the BDV 
antigens will not be presented on the host cell surface 

15 to trigger immune -mediated rejection in the host. 
Alternatively, other methods known in the art may be used 
to avoid the immune rejection of BDV transfected cells. 
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agala g ™* samiencinq of Genomic RK* from Borna Disease 

,T4 fmwl Particles 

The studies in this example and Example 2, except 
with regard to p57, are also described in Briese, T., et 
al., Proc. Natl. Acad. Set, USA, 91:4362-4366 (1994) and Kliche, 
S., et al., J. Virol, 68: 6918-6923 (1994), respectively, 
both of which are hereby incorporated by reference in 
their entirety. In this example, the BDV genome was 
cloned to reveal antisense information for five open 
reading frames (ORPs) . From 5' to 3' on the antigenome, 
the ORFs are p40, p23, gpl8, p57 and pol. Proteins p40, 
p23 and gpl8 have been identified in infected cells and 
tissues: p40 and p23 are expressed at high levels in vitro 
and in vivo and are found in the nucleus and cytoplasm of 
infected cells {Bause-Niedrig, I., M. et al., Vet. Immunol. 
35 Immimopathol. , 31:361-369 (1992)}. gp 18 is a membrane- 



WO 96/21020 



PCT/US96/00418 



47 

associated glycoprotein that is expressed at lower 
levels. gp!8 was characterized in Example 2 below. 

Messenger RNAs {Kliche, S., et al., J. Virol, 68: 
6918-6923 (1994); Lipkin, W. I., et al., Proc. Natl Acad Sci. 
5 USA, 87:4184-4188 (1990); McClure, M. A. , et al. f J.Virol, 
66:6572-6577 (1992); Pyper, J. M. , et al., Virology, 
195:229-238 (1993); Thibault, K. J., M.S. thesis; 
University of California, Irvine (1992); Thierer, J., et 
al. f J. Gen. Virol, 73:413-416 (1992) and VandeWoude, S., et 

10 al., Science, 250:1278-1281 (1990)} and proteins {Bause- 
Niedrig, I., et al. r Vet Immunol Immunopathol , 31:361-369 
(1992); Haas, B., et al., J. Gen. Virol, 67:235-241 (1986); 
Ludwig, H., et al., Progr. Med Virol , 35:107-151 (1988); 
Schadler, R. , et al., J. Gen. Virol, 66:2479-2484 (1985) and 

15 Thiedemann, N. , et al., J. Gen. Virol, 73:1057-1064 (1992)} 
corresponding to three of these ORFs, p40, p23 and gpl8, 
have been found in infected cells and tissues in a 5' -3' 
expression gradient (p40 > p23 > gpl8) {Briese, T. # et 
al., Proc. Natl Acad Set USA : 91:4362-4366 (1994); Cubitt, B., 

20 et al., J.Virol, 68:1382-1396 (1994); and Richt, J. A., et 
al., J Gen. Virol, 72:2251-2255 (1991)}. 

Though Cubitt, B., et al., J. Virol, 68:1382-1396 
(1994) purported to have sequenced the BDV genome, their 
paper contains numerous errors. The errors included (1) 

25 failure to recognize deletions in subgenomic RNAs due to 
splicing; (2) misplacement of ORFs leading to the 
prediction of a 40kD protein instead of a 57kD protein 
and failure to detect ORF overlap of p57 with gp!8 and 
pol; and (3) selection of incorrect motifs for initiation 

30 of transcription. These mistakes were implicitly 
acknowledged in a subsequent paper, de la Torre, J. C, 
J. Virol, 68:7669-7675 (1994). Figure 1 of the latter paper 
incorporated the correct genomic organization and 
transcription map described in Example 1 of this 

35 application. A later minireview which compares the 
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10 



sequence differences between the above Cubitt, et al.'s 
genomic sequence and the sequence described in Example 1 
below concludes that the differences seem most likely due 
to cloning and/or sequencing errors (of Cubitt et al.'s) 
rather than natural differences between the nucleotide 
sequences of different strains. Schneeman, A., et al., 
to be published in Virology, 209 (1995) ; a co-author of the 
paper is Dr. Robert A. Lamb, the editor-in-chief of Virology 
and a Howard Hughes Medical Institute Investigator. Dr. 
Lamb was not a collaborator in the work described in 

Example 1 below. . 

in this Example, the 8,910 nucleotide BDV viral 
genome was cloned and sequenced using RNA from BDV 
particles. The viral genome has complementary 3' and 5' 
15 termini and contains antisense information for five open 
reading frames. Homology to Filo-, Paramyxo- and 
Rhabdoviridae is found in both cistronic and 
extracistronic regions . Northern analysis indicates that 
the virus transcribes mono- and polycistronic RNAs and 
uses termination/polyadenylation signals reminiscent of 
those observed in other negative -strand RNA viruses. BDV 
is likely to represent a previously unrecognized genus, 
bomaviruses, or family, Bornaviridae. within the order 
Mononegavirales . 
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MATERIALS AND METHODS 

BDV cDNA Library Preparation and Screening. 

Genomic RNA template for library construction was 
obtained from an oligodendrocyte cell line (Oligo/TL) 
acutely infected with BDV Strain V {Briese, T., et al., 
Proc. Natl. Acad Sci. USA 89:1148G-11489 (1992)}. For the first 
genomic library, RNA from one viral particle preparation 
was polyadenylated with poly (A) polymerase (GibcoBRL, 
Life Technologies, Inc., Grand Island, New York) to 
facilitate cloning from the 3' terminus by oligo d(T) 
primed cDNA synthesis. Libraries were prepared in pSPORT 
using the Superscript Plasmid system (GibcoBRL, Life 
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Technologies, Inc., Grand Island, New York). The first 
library was screened using pAB5 and pAF4 radiolabeled 
restriction fragments {Lipkin, W. I., et al., Proc. Natl 
Acad Set USA 87:4184-4188 (1990)}. Subsequent libraries 
5 were screened using radiolabeled restriction fragments 
from locations progressively 5' on the genomic RNA. 
5' -terminal sequence from each library was used to design 
an oligonucleotide primer for construction of the next 
library. DNA sequencing and sequence analysis. Plasmid 

10 DNA was sequenced on both strands by the 
dideoxynucleotide chain termination method {Sanger, F., 
et al., Proc. Natl Acad. ScL USA 74:5463-5467 (1977)} using 
bacteriophage T7 DNA polymerase (Sequenase version 2.0; 
United States Biochemical, Cleveland, Ohio) . Five to ten 

15 independent clones from each library were sequenced with 
overlap so that each region of the genomic RNA was 
covered by at least two clones. Four libraries were 
analyzed yielding -8.9 kb of continuous sequence. 
Nucleic acid sequence was analyzed using the Sequence 

20 Analysis Software Package (Genetics Computer, Inc., 
Madison, Wisconsin) . Database searches for related 
sequences and multiple sequence alignments were performed 
using FastA and Pileup. 

Sequence Determination at the 3' and 5' Termini of 

25 BDV Genomic RNA* Genomic RNA from one viral particle 
preparation (1-2 x 10 8 cells) was treated with tobacco 
acid pyrophosphatase (Epicentre Technologies, Madison, 
Wisconsin) and circularized with T4 RNA ligase (New 
England Biolabs, Inc., Beverly, Massachusetts) {Mandl, C. 

30 W. # et al., BioTechniques 10:484-486 (1991)}. The ligated 
RNA was reverse transcribed with Superscript II (Gibco 
BRL, Life Technologies, Inc., Grand Island, New York) 
using primer 5' -GCCTCCCCTTAGCGACACCCTGTA (SEQ ID NO: 
11) , complementary to a region 465 nucleotides (nt) from 

35 the 5' terminus of the BDV genome. A 2 /il aliquot of the 
reverse transcription reaction was used to amplify the 
ligated region by the polymerase chain reaction (PCR) 
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using Stoffel fragment (Perkin- Elmer Cetus, Norwalk, 
Connecticut) . Primers used in the first round of PCR 
were 5' -GCCTCCCCTTAGCGACACCCTGTA (SEQ ID NO: 11) and 
5 ' -GAAACATATCGCGCCGTGCA (SEQ ID NO: 12), located 241 nt 
5 from the 3' terminus of the BDV genome. Amplified 
products were subjected to a second round of PCR using a 
nested set of primers: 5 ' -TACGTTGGAGTTGTTAGGAAGC (SEQ ID 
NO: 13), 251 nt from the 5' terminus, and 

5' -GAGCTTAGGGAGGCTCGCTG (SEQ ID NO: 14), 120 nt from the 
10 3' terminus. PCR products were cloned {Schneider, P. A., 

et al., J. Virol. 68:63-68 (1994)} and sequence across the 

5'/3' junction was determined from five independent 

isolates. 

Northern hybridization. Poly (A)* enriched RNA 
15 extracted from acutely infected rat brain using FastTrack 
(invitrogen Corp., San Diego, California) was 
size- fractionated on 0.22 M formaldehyde/1. 0% agarose 
gels {Tsang, S. S., et al., BioTechniques 14:380-381 
(1993)}. transferred to Zeta-Probe GT nylon membranes 
20 (Bio-Rad Laboratories, Richmond, California) and 
hybridized with random-primed 32P-labeled restriction 
fragments {Feinberg, A. P., et al., Anal. Biochem. 132:6-13 
(1983)} representing ORFs across the BDV genome (FIG. 6 
b). RNA transfer, hybridization and washing were 
25 performed following the manufacturer's protocol (Bio-Rad 
Laboratories, Richmond, California) . 



RESULTS 

The following figures present some of the results: 
30 FIG 3. (a) Organization of the BDV genome. 

Hatched boxes represent coding sequence complementary to 
ORFs for identified proteins, p40, p23, gpl8, or putative 
proteins, P 57. pl80. (pl80 is also referred to as pol.) 
Overlap is indicated by cross-hatched areas. Length of 
coding sequence corresponding to ORFs in nucleotides is 
indicated in brackets. Underlined italic numbers 
indicate length of sequence from stop codon complement to 
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last templated uridine of termination/polyadenylylation 
signal (black boxes) . Italics with arrow indicate number 
of nucleotides in intervening sequence between p40 
polyadenylylation signal and p23 coding sequence and 
5 between p23 polyadenylylation signal and gpl8 coding 
sequence, respectively. Italics with dashed arrow 
indicate number of noncoding nucleotides at termini of 
the genome. (b) Coding potential of genome. Genomic 
sequence was translated in all six possible reading 

10 frames (3' -5' negative sense; 5' -3' positive sense) by 
using FRAMES (Genetics Computer Group) . ORFs are 
indicated by bars and hatched boxes. 

FIG. 4. Alignment of the pl80 (pol) ORF and 
negative -strand RNA virus L -polymerase amino acid 

15 sequences with PILEUP. Solid lines indicate conserved L- 
polymerase motifs (a, A, B, C, D) . BDV sequence is 
indicated with double arrowheads. Rhabdoviridae : RaV, 
rabies virus; VSV, vesicular stomatitis virus; SYN, 
sonchus yellow net virus. Paramyxoviridae : MeV, measles 

20 virus; SeV, Sendai virus; NDV, Newcastle disease virus; 
RSV, respiratory syncytial virus. Filoviridae: MaV, 
Marburg virus. Numbers indicate amino acid range shown. 
Uppercase letters in viral sequence lines indicate 
residues conserved in more than six sequences. Uppercase 

25 letters in consensus line (Con) indicate presence of 
identical or conserved amino acids in BDV. Agreement of 
BDV sequence with either rhabdo- or paramyxoviruses is 
indicated by * or x, respectively. + , Nonconserved 
glycine residue in BDV. 

30 FIG. 5. Sequence analysis of BDV genomic termini, 

(a) Similarity of 3' -terminal BDV sequence to leader 
regions of Rhabdoviridae (RaV) , VSV) , Paramyxoviridae 
(MeV, SeV, NDV, RSV) , and Filoviridae (MaV) . 
Abbreviations are as in FIG. 2. EboV, Ebola virus. 

35 Sequences are aligned by using arbitrary gap insertion to 
optimize nucleotide matching. (b) Comparison of 

complementarity at 3' and 5' termini of BDV genomic RNA 
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with that of four other nonsegmented, negative -strand RNA 
viruses. The 3' and 5' terminal sequences for each virus 
are shown in viral RNA (3' -5', negative sense) 
orientation. Underlined sequence refers to 

transcriptional start of first gene or end of the L- 
polymerase gene (also referred to as "pol gene"), 
respectively (predicted for BDV). The end of the L- 
polymerase gene of RaV is located outside the region 
shown. 

FIG. 6. Map of BDV subgenomic RNAs relative to the 
viral antigenome. (a) Northern hybridization analysis of 
rat brain poly (A) * RNA. Each lane was hybridized with a 
probe representing a major BDV ORF as indicated by the 
letters A-E (see b) . Results of hybridization with 
15 probes C* and E* were identical to results of 
hybridization with probes C and E, respectively (data not 
shown) . Numbers at left indicate size of RNA markers in 
kilobases. Numbers at right indicate estimated size of 
major transcripts. (b) Position of viral transcripts 
with respect to antigenome as determined by Northern 
hybridization and sequence analysis. Dashed lines 
indicate regions in the 1.5-kb RNA and the 6.1-kb RNA 
that contain a deletion. The boundaries of the deletions 
are not known. Relative positions of probes used for 
25 Northern hybridization are shown. On the ORF map, 
potential start codons are indicated with upward lines; 
0, start codons predicted to be functional; x, potential 
start codon present in strain V that is absent in strain 
He/80 (see text). Potential termination sites are 
30 indicated with downward lines. Use of T2 and T3 has been 
confirmed {McClure, M. etal.J. Virol, 66:6572-6577; Thierer, 
J. etal.,J.Gen. Virol.. 73:413-416}; use of T5 and T7 is 
consistent with hybridization results. Termination at 
tl t4 and t6 has not been observed (see a). (c) 
35 Alignment of the seven potential termination sites of 
BDV. Location of sites is indicated in the ORF map. 
Stop codons are underlined. Lowercase letters indicate 
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termination/polyadenylylation consensus sequence. No 
termination/polyadenylylation site was found at or near 
the end of the gpl8 ORF. 

5 Sequencing of Genomic BDV RNA. 

Beginning from the 3' terminus, a series of four 
overlapping cDNA libraries was constructed using BDV 
particle RNA {Briese, T., et al., Proc. Natl Acad Set USA 

89:11486-11489 (1992)} as template. Previous studies 

10 have shown that the genomic RNA is not polyadenylated {de 
la Torre, J., et al., Virology 179:853-856 (1990)}. Thus, 
to construct the first library, genomic RNA was 
polyadenylated in vitro in order to facilitate oligo 
d (T) -primed cDNA synthesis . For the subsequent three 

15 libraries, genome -complementary oligonucleotide primers 
were designed based on 5' terminal sequence determined in 
the previous round of cloning. Each region of the genome 
was sequenced using a minimum of two independent clones. 
To determine the sequences at the termini, genomic RNA 

20 was circularized and sequenced across the junction using 
five independent clones. 

The 8,910 nt BDV genome contained antisense 
information for five major ORFs flanked by 53 nt of 
noncoding sequence at the 3' terminus and 91 nt of 

25 noncoding sequence at the 5' terminus (FIG. 3) . In 3' -5' 
order, the first two ORFs encoded two previously 
described viral proteins, p40 {McClure, M. A., et al., J. 
Virol. 66:6572-6577 (1992)} and p23 {Thierer, J., et al . , 
1 Gen. Virol 73:413-416 (1992)}. The third, fourth and 

30 fifth ORF had coding capacities of 16 kDa (gpl8) , 57 kDa 
(p57) and 190kDa (pl80) , respectively (FIG. 3a). Note: 
plBO is now known as "pol" . Predicted amino acid 
sequence for the 16 kDa ORF correlated with microsequence 
data for an 18 kDa BDV glycoprotein (see the section 

35 below for gpl8 glycoprotein) , originally described as the 
Borna disease-associated 14.5 kDa protein {Schadler, R., 
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et al., J. Gen. Virol. 66:2479-2484 (1985)}. The first three 
ORFs showed no overlap and were in frame with the fifth 
ORF (FIG. 3b) . The 57 kDa ORF was in a +1/-2 frame 
relative to the other four ORFs and overlapped the 
adjacent ORF for gpl8 by 28 amino acids and ORF pl80 by 
34 amino acids. All ORFs were located on the (+) strand, 
complementary to the genomic RNA. ORF analysis of the 
genomic (-) strand showed only three small ORF's, each 
with a coding capacity of less than 16 kDa (FIG. 3b) . 



10 



Homology analysis of coding sequence. 

Predicted amino acid sequence for the identified 
ORFs was used to examine databases for similarity to 
other proteins. Previous analysis of the ORF encoding 
15 p40 had revealed distant sequence similarity to 
L-proteins of Paramyxoviridae and Rhabdoviridae {McClure, 
M. A., et al., /. Virol. 66:6572-6577 (1992)}. FastA 
analysis of translated sequence from ORFs p23, gpl8 and 
p57 showed no apparent similarity to other viral 
20 sequences; however, ORF pl80 sequence consistently 
retrieved L-polymerases of Paramyxo- and Rhabdoviridae. 
Alignment of ORF pl80 (pol) sequence with sequence of 
RNA-dependent RNA polymerases of negative -strand RNA 
viruses showed conservation of both sequence and linear 
25 order of regions homologous among these proteins. 
Extensive conservation was found in the four 
characteristic motifs for L-polymerases of 
negative-strand RNA viruses (A-D in FIG. 4) {Poch, O., et 
al., EMBOJ. 8:3867-3874 (1989) and Poch, O., et al., J. 
30 Gen. Virol. 71:1153-1162 (1990)}. With the exception of the 
glycine residue in motif B (position 322 of the 
alignment), conservation was found for the individual 
amino acid residues postulated to participate in 
polymerase function {Poch, 0., et al., EMBOJ. 8:3867-3874 
35 (1989)}. Conservation was also found for a motif (a in 
FIG. 4) proposed to participate in template recognition 
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{Poch, 0., et al., J. Gen. Virol. 71:1153-1162 (1990) and 
Barik, S., et al., Virology 175:332-337 (1990)}. The 
GCG/pileup alignment placed ORF pi 80 sequence between 
polymerases of Paramyxo- and Rhabdoviridae . This 
5 intermediate position is reflected by the presence of 
conserved amino acids which are in agreement with either 
the rhabdo- or the paramyxovirus sequences (* or x, 
respectively; FIG. 4) . The distance between conserved 
motifs a and A was found to be short in BDV as it is in 

10 rhabdoviruses , whereas this region is highly variable in 
length and sequence among paramyxoviruses {Poch, O., et 
al., J. Gen Virol 71:1153-1162 (1990)}. The GCG/pileup 
generated dendrogram, obtained using complete ORF pl80 
and L-protein sequences, indicated that the putative BDV 

15 polymerase was more closely related to L-polymerases of 
Rhabdoviridae than Paramyxoviridae . 

Analysis of Noncoding Sequence at the Genomic Termini - 

3' terminal genomic sequence had a high A/U content 

20 of 60.5 % with an A to U ratio of -1:2, similar to 3' 
leader sequences of other negative- strand RNA viruses. 
At the extreme 3' end, filo-, paramyxo- and rhabdoviruses 
have a common G/U rich region (FIG. 5a) . In BDV, as in 
respiratory syncitial virus, rabies virus and 

25 filoviruses, this region was not located at the 3' 
extremity. Comparison of the 3' and 5' termini of BDV 
genomic RNA revealed complementarity similar to that 
found in other negative -strand RNA viruses (Keene, J. D., 
et al., J. Virol 32:167-174 (1979) and Tordo, N., et al . , 

30 Virology 165:565-576 (1988)} (FIG. 5b). Alignment of the 
genomic termini allowed formation of a terminal 
panhandle, with the first three nucleotides unpaired. 
The subsequent complementary area of 6 nucleotides 
(positions 4-9 and 8907-8902) could be extended by one 

35 gap insertion between position 8901/8,902 resulting in an 
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additional 10 nt stretch of complementarity with a single 
mismatch (positions 18 and 8994; FIG. 5b). 

Identification of Potential Termination/ Polyadenylation 
5 Sites. 

Sequence preceding the poly (A) tracts of two cloned 
BDV mRNAs (UA,) {McClure, M. A., et al., J. Virol. 66:6572- 
6577 (1992) and Thierer, J . , et al., J. Gen. Virol. 73:413-416 
(1992)} was used to analyze genomic sequence for 

10 homologous sites that could serve as potential 
termination/ polyadenylation signals. Seven sites were 
found (FIG. 6 c) . Northern hybridization experiments 
supported use of four of these sites (T2, T3 f T5 and T7) 
and allowed identification of a 

15 termination/polyadenylation signal consensus sequence 
(CMNMYYMNWA,) , where M is A or C, Y is C or U, and W is A 
or D. Only one of the three remaining sites (t6) matched 
the consensus sequence (FIG. 6c) . 

20 Northern Hybridisation Analysis. 

Restriction fragments representing the five ORFs 
were used as probes for hybridization to poly (A)* enriched 
RNA isolated from acutely infected rat brain by FastTrack 
(FIG. 6a and b) . Because this procedure does not 

25 entirely eliminate poly (A)- RNAs, small levels of BDV 
genome-size RNA can usually be detected in these 
preparations. To allow determination of the relative 
abundance of RNAs detected by each probe, exposure times 
were normalized to the signal of the 8.9-kb RNA. 

30 Consistent with the 3' to 5' transcriptional gradient 
found for other negative -strand RNA viruses, of the eight 
subgenomic RNAs identified, those detected by the 3 '-most 
probes (genomic orientation) , A and B, were more abundant 
than those detected by the more 5' probes (FIG. 6a and 

35 b) . 

Mapping of the eight transcripts to the genome by 
Northern hybridization indicated use of only three sites 
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for transcriptional initiation and four sites for 
termination. Probes C* and E* were used to distinguish 
between termination at T5 or t6 (FIG. 6b) . The patterns 
of hybridization with probes C* and E* were identical to 
5 those obtained with probes C and E, respectively 
indicating termination at T5 (data not shown) . Probes 
corresponding to p40 (A) and p23 (B) detected 
monocistronic RNAs of 1.2 kb and 0.75 kb, respectively 
(FIG. 6). Probes A and B also detected a 1.9 kb RNA 

10 consistent with failure of transcriptional termination at 
the p40 termination site {Pyper, J. M. , et al., Virology 
195:229-238 (1993)}. Transcriptional readthrough was 
also found for polycistronic transcripts of 3.5, 2.8 kb 
and 7.1 kb. The 3.5 kb RNA detected by probes B, C, D 

15 and C*, is likely to initiate at or near the beginning of 
ORF p23 and terminate at T5. The 2.8 kb RNA detected by 
probes C, D and C* f is likely to initiate at or near the 
beginning of ORF gpl8 and terminate at T5. The 7.1 kb 
detected by probes C, D, C*, E* and E # is likely to 

20 initiate at or near the beginning of ORF gp!8 and to 
continue through T5 until it terminates at T7. Probes C 
and C* both hybridized to a 1.5 kb RNA and a 6.1 kb RNA. 
Interestingly, neither the 1 . 5 kb RNA nor the 6 . 1 kb RNAs 
was detected by probe D, located between C and C* on the 

25 viral genome. These findings are consistent with 
posttranscriptional modification resulting in a 1-1.3 kb 
deletion (FIG. 6) . 

DISCUSSION 

30 The order Mononegavirales, which incorporates the 

families Filoviridae, Paramyxoviridae and Rhabdoviridae, 
has distinct characteristics that include: (1) a 
nonsegmented negative sense RNA genome, (2) linear genome 
organization in the order 3' untranslated region /core 

35 protein genes /envelope protein genes /polymerase gene 
/untranslated 5' region, (3) a virion associated 
RNA-dependent RNA polymerase, (4) a helical nucleocapsid 
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that serves as template for replication and 
transcription, (5) transcription of 5-10 discrete, 
unprocessed mRNAs by sequential interrupted synthesis 
from a single promoter and (6) replication by synthesis 
5 of a positive sense antigenoroe {Pringle, C. R., et al., 
Arch. Virol. 117:137-140 (1991)}. The genomes of rhabdo-, 
paramyxo- and filoviruses range in size from 11 to 20 kb. 
The BDV genome has been estimated to be between 8.5 
{Lipkin, W. I., et al., Proc. Natl. Acad Set USA 87:4184-4188 
10 (1990) and de la Torre, J., et al., Virology 179:853-856 
(1990)} and 10.5 kb {VandeWoude, S., et al., Science 
250:1276-1281 (1990) and Richt, J., et al., J. Gen Virol. 
72:2251-2255 (1991)} in length. Our data confirm that 
the BDV genome, at only 8910 nt, is smaller than those of 
15 other negative-strand RNA viruses. Several features 
suggest that BDV is a member of the order 
Mononegavirales: organization of ORFs on the genome, 
extensive sequence similarities of the largest BDV ORF to 
L-polymerases of rhabdo-, paramyxo- and filoviruses, 
20 homology of 3' noncoding sequence to leader sequences of 
Mononegavirales and complementarity of BDV genomic 
termini . 

In 5' to 3' antigenomic orientation, the first ORF 
contains 1110 nt. Due to a more favorable translation 
25 initiation context {Kozak, M., Nucleic Acids Res. 15:8125-8148 
(1987)}, it is likely that the second AUG codon, 39 nt 
inside the ORF, is used to express a 357 aa protein of 
39.5 kDa (p40) {Pyper, J. M. , et al., Virology 195:229-238 
(1993) } • 26 nt downstream of the stop codon is a 

30 polyadenylation signal {McClure. M. A., et al., J- Virol. 
66:6572-6577 (1992)} (T2, FIG. 6 b and c) . The second 
ORF starts 79 nt from the p40 polyadenylation site. It 
has a length of 603 nt coding for a 201 aa protein of 
22.5 kDa (p23) . The stop codon of ORF p23 is part of the 

35 polyadenylation signal {Thierer, J., et al., J. Gen. Virol. 
73:413-416 (1992)} (T3, FIG. 6b and c) . Analysis of the 
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intergenic region between ORFs p40 and p23 has shown that 
this sequence is less conserved among different BDV 
isolates than coding sequences for p40 and p23 
{Schneider, P. A. , et al., J. Virol 68:63-68 (1994)}. 
5 Therefore, expression of a small ORF in this region (x, 
FIG. 3 b) ; {VandeWoude, S., et al., Science 250:1276-1281 
(1990) and Pyper, J. M. , et al., Virology 195:229-238 
(1993) } that overlaps with ORF p23 seems unlikely 
{Schneider, P. A., et al., J. Virol 68:63-68 (1994)}. Ten 

10 nt downstream of the p23 polyadenylation signal is the 
third ORF, 426 nt in length, that codes for a 142 aa 
(16.2 kDa) protein. Due to glycosylation, the protein 
expressed from this ORF has a Mr of ~18 kDa (gpl8) . 

No polyadenylation signal similar to those 

15 identified for p40 and p23 mRNAs {McClure, M. A., et al., 
J. Virol 66:6572-6577 (1992) and Thierer, J., et al., J. Gen. 
Virol 73:413-416 (1992)} was found near the end of the 
gpl8 ORF (FIG. 6b and c) . Instead, the following ORF 
overlaps with the end of the gpl8 ORF by 28 aa. It has 

20 a total size of 1,509 nt that could code for a 503 aa 
protein of 56.7 kDa (p57) . The ORF has two AUG codons in 
the overlap with gpl8. A third AUG located outside the 
overlap is 451 nt from the beginning of the ORF. Which, 
if any, of these AUGs is used is unknown as no protein 

25 has been identified. A potential polyadenylation site is 
located 28 nt downstream of the p57 ORF (t4) . However, 
Northern hybridization results suggest that this site is 
a weak or nonfunctional signal, because no major 
transcript (s) were found to stop at this position (FIG. 

30 6) . 

The fifth ORF encompasses more than half the length 
of the genome. A potential polyadenylation site (T7) , 
similar to that seen at the end of ORFs p40 and p23, is 
found 33 nt from the stop codon of pl80 (pol) ORF (FIG. 
35 6b and c) . Deletions identified by Northern 

hybridization analysis suggested that viral mRNAs might 
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undergo post -transcriptional modification by RNA 
splicing. This hypothesis was subsequently confirmed by 
applicants {Schneider. P.A. et al, J. Virol. , 68:5007-5012 
(1994); Schneemann, A. et al. J. Virol. , 68:6514-6522 (1994), 
5 hereby incorporated in their entirety.} RNA splicing 
extends the pol ORF by 459 nucleotides allowing 
prediction of a protein of 190kDa. {Schneider, P. etal.J. 
Virol, 68:5007-5012 (1994)}. Although functional studies 
of BDV proteins have not yet been done, the organization 
10 of the viral genome together with the limited biochemical 
data available suggest possible roles for individual 
proteins in the virus life cycle. Four lines of evidence 
suggest that p40 is likely to be a structural protein: 
(1) like nucleocapsid proteins (N) of rhabdo- and 
15 paramyxoviruses {Banerjee, A. K. , et al., Pharmacol, Ther. 
51:47-70 (1991)} (except pneumoviruses {Collins, P. L. , 
The Paramyxoviruses, ed. Kingsbury, D. W. (Plenum, New York) , 
pp. 103-162 (1991)}), P40 is found in the most 3' 
position on the genome; (2) p40 is similar in size to N 
20 proteins; (3) both p40 {Pyper, J. M. , et al., Virologyy 
195:229-238 (1993) and Ludwig, H. , et al., Prog. Med. Virol. 
35:107-151 (1988)} and N proteins {Banerjee, A. K. , et 
al., Pharmacol, Ther. 51:47-70 (1991)} are abundant in 
infected cells and particles; (4) neither N proteins 
25 {Banerjee, A. K. , et al., Pharmacol, Ther. 51:47-70 (1991)} 
nor P 40 {Thiedemann, H., et al., J. Gen. Virol. 73:1057-1064 
(1992)} are phosphorylated or glycosylated. p23, a 
phosphorylated protein {Thiedemann, H . , et al . , J. Gen. Virol. 
73:1057-1064 (1992)}, is in the next position on the 
30 genome. ORF p23 corresponds in position to genes coding 
for phosphoproteins in Paramyxoviridae (P) and 
Khabdoviridae (NS) {Banerjee, A. K., et al., Pharmacol. Ther. 
51:47-70 (1991)}. This suggests that P 23 might serve a 
similar role in the BDV system. In support of this 
35 hypothesis, GCG analysis showed that the protein has a 
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high Ser/Thr content (16%) , is charged (pi 4.8) and 
contains a N- terminal cluster of acidic amino acids 
compatible with structural features of P/NS proteins 
{Banerjee, A. K. , et al., Pharmacol Ther 51:47-70 (1991)}, 

. 5 In previously described Mononegavirales, the next gene 
codes for matrix protein (M) {Banerjee, A* K., et al., 
Pharmacol Ther 51:47-70 (1991)}. gpl8 occupies this 
position on the BDV genome. Though small for a matrix 
protein, gpl8 has a predicted pi ,10, that is close to 

10 the basic pi of M proteins, -9, and its 
membrane -association would be compatible with a matrix 
protein function. For p57, computer analysis predicted 
similarities to glycoproteins of negative -strand RNA 
viruses: potential glycosylation sites as well as 

15 N- terminal and C- terminal hydrophobic "anchor" domains 
(data not shown) . The largest ORF (pol) is located most 
5' on the genome. Its size, 5' position and conservation 
of motifs considered critical to L-polymerase activity, 
suggest that this ORF is likely to code for the BDV 

20 polymerase (FIG. 6) . 

Analysis of Northern hybridization experiments in 
conjunction with genomic sequence data has allowed 
construction of a tentative transcription map (FIG. 6) . 
While it has not been possible to identify signals for 

25 initiation of transcription by using consensus sequences 
of other negative -strand RNA viruses, we have identified 
consensus sequence for termination/polyadenylation in BDV 
using known ends of p40 and p23 mRNAs {McClure, M. A., et 
al., J. Virol 66:6572-6577 (1992) and Thierer, J., et al., 

30 J. Gen Virol 73:413-416 (1992)} (FIG. 6c). These sequences 
appear to function as weak termination signals. Unlike 
other negative -strand RNA viruses, BDV shows a high 
frequency of readthrough transcripts. Organization and 
sequence similarities to Filo-, Paramyxo- and 

35 Rhabdoviridae suggest that BDV is a member of the order 
Mononegavirales. Dependent on the parameters and regions 
selected for homology analysis, BDV can be represented as 
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or 



being more closely related to filo-, paramyxo 
rhabdoviruses. Overlap of coding sequence, high 
frequency of polycistronic readthrough transcripts and 
posttranscriptional modification are properties of the 
BDV system not found in other members of the order 
Mononegavirales. These features could serve as 
independent mechanisms for modulation of gene expression 
to achieve the persistent, non-cytopathic infection that 
is a cardinal characteristic of this neurotropic virus. 



10 
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Using methods for isolation of the 14.5-kDa protein 
{Schadler, R. . et al.. J. Gen. Virol., 66:2479-2484 (1985)}, 

15 we have purified a glycoprotein from BDV- infected rat 
brain that is encoded by a 429-nucleotide (nt) ORF 
located 3' to ORF p23 on the viral antigenome. The 
protein is predicted to be 16.2 kDa; glycosylation 
results in a 1- to 2-kDa increase in molecular weight. 

20 This glycoprotein, gpl8, is the first glycoprotein to be 
identified in the BDV system. Lectin binding and 
endoglycosidase sensitivity assays suggest that gpl8 is 
an unusual N- linked glycoprotein. 

25 MATERIALS AND METHODS 

Infection of animals and cultured cells. 

Animals and cells were infected with BDV strain 
He/80 {Herzog, S., et al., Med Microbiol. Immunol. , 168:153- 
158 (1980) and Schneider, P. et al.. Virol. 68:63-68 

30 (1994)}. Newborn Lewis rats were infected by 
intracranial injection with 1.5 x 10* focus -forming units 
of BDV. Three weeks after infection, animals were 
sacrificed and brains were removed for isolation of BDV 
particles {Carbone, K. , et al., J. Virol., 61:3431-3440 

35 (1987)} or gpl8. C6 cells and MDCK cells were 
persistently infected with BDV as described previously 
{Carbone, K. M. , J. Virol. . 67:1453-1460 (1993) and Herzog, 
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S., et al., Med Microbiol. Immunol, 168:153-158 (1980)}. 
Monolayers of rabbit fetal glial cells were acutely 
infected by adding BDV at 1.0 focus -forming unit per cell 
to the culture medium (Dulbecco modified Eagle medium, 5% 
5 fetal calf serum; Gibco BRL, Grand Island, New York) . 

Protein purification and microsequencing . 

Protein was purified from infected cells and tissues 
by detergent -salt extraction by the method of Schadler et 

10 al. {Schadler, R., et al., J. Gen Virol, 66:2479-2484 
(1985)}. For microsequencing, protein was cleaved with 
10% cyanogen bromide in 75% formic acid (Sigma Chemical 
Co., St. Louis, Missouri). Peptide fragments were 
separated by reverse-phase high-performance liquid 

15 chromatography (RP-HPLC) on a Vydac C-18 column, using a 
trif luoroacetic acidacetonitrile gradient . Sequence 
determinations were performed by automated Edman 
degradation on a Hewlett-Packard model G1000A protein 
sequencer . 

20 

Antibodies • 

Antibodies to purified gpl8 were produced in 3- 
month-old BALB/c mice. Animals were injected 

subcutaneously with 5 fig of protein in Freund's complete 

25 adjuvant and boosted 3 weeks later with a subcutaneous 
injection of 3 fig of protein in Freund's incomplete 
adjuvant. For 6 weeks thereafter, at 2-week intervals, 
animals received intraperitoneal injections of 5 fig of 
protein in phosphate-buffered saline (PBS) with 5 fig of 

30 lipopolysaccharide {Salmonella typhimurium; Difco, Detroit, 
Michigan) (three injections) . Blood was drawn every 2 
weeks during weeks 7 through 28 for measurement of serum 
antibody titer to purified protein by Western blotting 
(immunoblotting) . Antisera collected at week 28 were 

35 used for virus neutralization studies. Rabbit antisera 
to recombinant BDV p40 and p23 were used as controls (see 
Example 3, below) . 
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Cloning and sequencing of CDNA encoding gpl8. 

gpl8-specif ic oligonucleotides were used to amplify 
full-length coding sequence for gpl8 from two BDV- 
infected adult rat brain cDNA libraries {Lipkin, W. I., 
5 et al., Proc. Natl. Acad Sci. USA, 87:4184-4188 (1990) and 
McClure, M. A. , et al-, J.Virol. 66:6572-6577 (1992)} as 
well as total cellular RNA {Chirgwin, J. J., et al., 
Biochemistry, 18:5294-5299 (1979)} and poly(A)- RNA {Aviv, 
H., et al., Proc. Natl. Acad Sci. USA. 69:1408-1412 (1972)} 
10 extracted from infected rat brain. Reverse transcription 
(RT) was performed with an oligo(dT) primer and 
Superscript II (Gibco BRL, Life Technologies, Inc., Grand 
Island, New York ) . PCR was carried out with Ampli-Taq 
Stoffel fragment according to standard protocols (Perkin- 
15 Elmer, Norwalk, Connecticut) with the following primer 
pair: 5' -terminal JfltoI-gplB sense oligonucleotide (Xhol- 
gp 18 -Sl), TCOTCGAGA TGAATTCAAAACATTCCTATC (nt 1892 to 
1914; Xhol restriction site indicated by underlining) (SEQ 
ID NO: 15); and 3' -terminal gplB antisense 
20 oligonucleotide (gpl8-ASl) . CTAAGGCCCTGAAGATCGAAT (nt 
2301 to 2321) (SEQ ID NO: 16) . Products were purified by 
agarose gel electrophoresis using a USBioclean 
purification kit (U.S. Biochemical, Cleveland, Ohio) and 
cloned into Bluescript SKII+ (Stratagene, San Diego, 
25 California) prepared with 3' T overhangs {Marchuk, D., et 
al., Nucleic Acid Res., 19:1154 (1990)}. A minimum of three 
independent clones from each template source was 
sequenced on both strands by the dideoxynucleotide chain 
termination method using bacteriophage T7 DNA polymerase 
30 (Sequenase; U.S. Biochemical, Cleveland, Ohio). The 
plasmid resulting from amplification of neonatally 
infected rat brain RNA was named pBDV-gpl8. 
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In vitro transcription, translation, and eotranslational 
processing. 

Plasmid clones pBDV-gpl8 and pBDV-23 {Thibault, K. 
J., M.S. thesis, University of California, Irvine (1992)} 
5 linearized with EcoRl were used as templates for in vitro 
synthesis of capped RNA transcripts. Transcription 
products or Saccharomyces cerevisiae a -factor mRNA (control for 

glycosylation) were translated in vitro by using nuclease- 
treated rabbit reticulocyte lysates (Promega Corp., 

10 Madison, Wisconsin) in the presence of [ 3S S] methionine 
(Amersham Corporation, Arlington Heights, Illinois) . 
Cotranslational processing was assessed by in vitro 
translation using reticulocyte lysates supplemented with 
canine microsomal membranes (Promega, Madison, 

15 Wisconsin) . Transcription, translation, and 

cotranslational processing studies were performed 
according to the manufacturer's protocols. Translation 
products were immunoprecipitated with mouse anti-gpl8 
serum and then size fractionated by sodium dodecyl 

20 sulfate (SDS) -polyacryl amide gel electrophoresis (PAGE) 
(13% gel) {Laemmli, U. K. , et al., J.MoLBiol, 80:575-581 
(1973)} for autoradiographic analysis. Methods for 
immunoprecipitation and autoradiography have been 
described elsewhere {Lipkin, W. I., et al., Proc. Natl Acad 

25 ScLUSA, 87:4184-4188 (1990)}. 

Protein gel electrophoresis and imnunoblo t ting . 

Proteins were size fractionated by SDS-PAGE (12% 
gel) and then transferred to Immobilon-N membranes 

30 (Millipore Corp., Bedford, Massachusetts). Primary 
antisera for immunoblotting were from rats chronically 
infected with BDV (day 100 after intracranial infection) 
or mice immunized with purified gpl8. The secondary 
antibody was alkaline phosphatase -conjugated goat 

35 antimouse immunoglobulin G (Sigma Chemical Co., St. 
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Louis, Missouri) ; the substrate was Western Blue (Promega 
Corp., Madison, Wisconsin). 

Carbohydrate analysis. 

Purified protein was size fractionated by SDS-PAGE 
(13% gel) and then either silver stained for detection of 
protein or carbohydrate {Tsai, C. M., et al., Anal. Biochem. , 
119:115-119 (1982)} or transferred to Immobilon-N 
membranes (Millipore, Bedford, Massachusetts) for lectin 
staining. The carbohydrate composition of immobilized 
protein was determined by using a DIG Glycan 
Differentiation Kit (Boehringer Mannheim, Indianapolis, 
Indiana) and peroxidase -labeled Bandeiraea simplicifolia 
agglutinins I and II (BS-I and BS-II; Sigma Chemical Co., 
St. Louis, Missouri). The substrate for peroxidase was 
4-chloro-l-naphthol (Pierce Chemical Company, Rockford, 
Illinois) . Glycosidase digests of native and denatured 
protein (incubated for 5 minutes at 100°C in 0.01% SDS) 
were performed according to the manufacturer's protocols, 
using the following endoglycosidases : endoglycosidase F 
and 

tf-glycosidase F; O-glycosidase ; N- glycosidase F; 
endoglycosidase F, JV- glycosidase free; endoglycosidase H; 
and endo-0-galactosidase (Boehringer Mannheim) . 



25 



RESULTS 

The following figures present some of the results: 
FIG. 7. Sequence of ORF gpl8. The diagram shows 
the location of ORF gpl8 on the viral antigenome (5' -3') 
30 relative to ORFs p40 and P 23 (boxes) . ORF gpl8 sequences 
were from Oligo/TL cells infected with BDV strain V (SV) 
and rat brain infected with BDV He/80 (RB) . Peptide 
sequences (HI. P#2, and P#3) were obtained by 
microsequencing of purified protein from He/80 -infected 
35 rat brain. Periods indicate identical nucleotide or 
amino acid sequences. Variable amino acid residues 
(large asterisk) and stop codons (small asterisks) are 
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indicated. Underlining indicates potential glycosylation 
sites. 

FIG. 8. Glycan determination of gpl6. gp!8 
isolated from infected rat brain was size fractionated by 
5 SDS-PAGE (12% gel) then transferred to an Immobilon-N 
membrane for lectin staining (see Materials and Methods) . 
Lanes: 0, protein detection by mouse anti-gpl8 serum; 1, 
ConA; 2, wheat germ agglutinin; 3, D. stramonium agglutinin; 
4, BS-I; 5, BS-II; 6, G nivalis agglutinin; 7, S. nigra 

10 agglutinin; 8, KL amrensis agglutinin; 9, peanut agglutinin. 
Positions of molecular weight markers are shown in 
kilodaltons at the right. 

FIG. 9. gpl8 is sensitive to endoglycosidases . 
gpl8 isolated from infected rat brain was treated with 

15 either buffer alone or endoglycosidase . Protein was size 
fractionated by SDS-PAGE (13% gel) and detected by silver 
staining. Lanes: 1, buffer; 2, endoglycosidase F plus 
JV-glycosidase F; 3 f endoglycosidase F (JV-glycosidase 
free); 4, endo-0-galactosidase. Positions of molecular 

20 weight markers are shown in kilodaltons at the right. 

FIG. 10. In vitro transcription, translation, and 
cotranslational processing of gpl8 . RNA transcripts were 
synthesized from pBDV-23 (a nonglycosylated BDV protein 
control) or pBDV-gpl8 and translated in vitro by using 

25 rabbit reticulocyte lysates in either the absence or 
presence of canine microsomal membranes. [ 35 S] methionine- 
labeled translation products were immunoprecipitated with 
antisera to p23 or gpl8 and protein A-Sepharose and then 
size fractionated by SDS-PAGE (13% gel) for 

30 autoradiography (A) or transferred to Immobilon-N 
membranes for ConA lectin staining (B) . Translated gpl8 
in lane 5 of panel A and lane 3 of panel B was incubated 
with endoglycosidase F plus W-glycosidase F prior to SDS- 
PAGE. (A) Lanes: 1, pBDV-23 RNA; 2, pBDV-23 RNA plus 

35 microsomal membranes; 3, pBDV-gpl8 RNA; 4, pBDV-gp!8 RNA 
plus microsomal membranes; 5, pBDV-gp!8 RNA plus 
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microsomal membranes, incubated with endoglycosidases . 
The long arrow indicates the position of glycosylated 
protein (lanes 3 and 4); the short arrow indicates the 
position of protein after treatment with endoglycosidase 
5 F plus tf-glycosidase F (lane 5) . The asterisk indicates 
nonspecific background signal (lane 5) . Positions of 
molecular weight markers are shown in kilodaltons at the 
right. (B) Lanes: 1, pBDV-gpl8 RNA; 2, pBDV-gpl8 RNA 
plus microsomal membranes; 3, pBDV-gpl8 RNA plus 
10 microsomal membranes, incubated with endoglycosidases. 

Isolation of gpl8. 

Protein was isolated from neonatal ly infected rat 
brain, acutely infected rabbit fetal glial cells (two 

15 passages), persistently infected C6 cells, and 
persistently infected MDCK cells, using the method of 
Schadler et al. {Schadler, R., et al., J. Gen. Virol, 
66:2479-2484 (1985)}. The purity of the protein was 
confirmed by silver staining of the protein after SDS- 

20 PAGE (data not shown). The quantity of protein was 
estimated in silver- stained gels by using lysozyme 
standards. Typical yields were 5 ng of protein from one 
neonatally infected rat brain and 2 fig of protein from 10« 
infected cultured cells. Protein from neonatally 

25 infected rat brain was used for microsequencing, 
carbohydrate analysis, and immunization of mice. 

Protein and nucleic acid sequence analysis. 

Direct microsequencing of gpl8 was not possible 
30 because of a blocked amino terminus; thus, to allow 
analysis, the protein was cleaved with cyanogen bromide. 
Sequencing of the cleavage mixture indicated the presence 
of three N termini. From the mixture, two peptides 
(peptides 1 and 3; FIG. 7) were isolated by RP-HPLC and 
35 sequenced individually, allowing inference of a third 
sequence (peptide 2; FIG. 7) by subtraction. Peptide 
sequences were used as probes to search ORFs located on 
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the BDV antigenome. The peptide sequences obtained from 
the purified gpl8 mapped to a 429-nt ORF (ORF gpl8) on 
the viral antigenome that predicts a 142 -amino- acid 
protein with a molecular weight of 16,244 (FIG. 7) . 
5 Genomic sequence corresponding to the gpl8 ORF was 

used to design probes and primers for identifying mRNA 
encoding gpl8. In each of two cDNA libraries prepared 
from BDV- infected adult rat brain poly (A)* RNA {Lipkin, W. 
I., et al., Proc. Natl Acad Set USA , 87:4184-4188 (1990) and 

10 McClure, M. A. , et al., 1 Virol, 66:6572-6577 (1992)}, 
100,000 recombinants were screened by hybridization with 
a 271-bp HincII-Hinft restriction fragment from pTB-BDV 
5.82 (nt 2062 to 2333 in the viral genome) {Briese, T., 
et al., Proc. Natl Acad ScL USA 91:4362-4366 (1994)}. These 

15 libraries were also screened by PCR using the 5' -terminal 
Afe>I-gpl8 sense primer (nt 1892 to 1914) and oligo(dT) . 
Total cellular and poly (A)* RNAs extracted from 
persistently infected C6 cells, BDV- infected adult rat 
brain, or 3 -week-old neonatally infected rat brain (the 

20 peak time point for m vivo expression of gpl8) were 
subjected to RT-PCR using oligo(dT) in combination with 
the 5' -terminal A7i0l-gpl8 sense primer. No gpl8-specif ic 
transcript corresponding to the size of ORF gp!8 was 
obtained in these experiments. In contrast, use of the 

25 5' -terminal AftoI-gpl8 sense primer in combination with a 
3' -terminal gplB antisense primer (nt 2301 to 2321) 
allowed amplification of gpl8 sequences from any of these 
sources by RT-PCR. In spite of variability at the 
nucleic acid level, the predicted amino acid sequence 

30 obtained from the different sources was the same as for 
strain V genomic sequence, with the exception of a single 
exchange in position 108 (E-»D) (FIG. 7) . 

Characterization of gpl8 as a glycoprotein. 

35 Purified gpl8 was size fractionated by SDS-PAGE. 

Modified silver staining revealed the presence of 
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carbohydrate; thus, fractionated protein was blotted onto 
Immobilon-N membranes to determine the presence of 
individual saccharides through lectin binding studies. 
Binding was observed with Cancanavalia ensiformis agglutinin 
5 (ConA), wheat germ agglutinin, Datura stramonium agglutinin, 
BS-I, and BS-II but not with Galanthus nivalis agglutinin, 
Sambucus nigra agglutinin, Maackxa amurensis agglutinin, and 
peanut agglutinin (FIG. 8) . This staining pattern was 
consistent with the presence of tf-acetylglucosamine, N- 
10 acetylgalactosamine, mannose, and galactose. In 
addition, native and denatured proteins were digested 
with specific endoglycosidases, size fractionated by SDS- 
PAGE, and then stained to assess molecular weight shift 
and presence or absence of carbohydrate. Treatment with 
15 O-glycosidase or endoglycosidase H had no effect (data 
not shown) . In contrast, treatment with endoglycosidase 
F and 

tf-glycosidase F resulted in a loss of 1 to 2 kDa (FIG. 9) 
and abrogation of lectin staining with ConA (data not 
20 shown) . Treatment with endoglycosidase F (AT-glycosidase 
free) or endo-0-galactosidase also resulted in a loss of 
1 to 2 kDa (FIG. 9) . 

In vitro transcription, translation, and processing of gpl8. 

25 With linearized pBDV-gpl8 used as a template, gplB 

RNA was transcribed and translated in vitro in either the 
presence or absence of canine microsomal membranes. The 
gpl8 RNA directed translation of two proteins of 16 and 
18 kDa that were recognized by monospecific murine 

30 antiserum to purified gplB. Translation in the presence 
of microsomal membranes led to an increase in the 
relative proportion of the 18 -kDa protein. Treatment 
with endoglycosidase F resulted in loss of the 18 -kDa 
protein species (FIG. 10A) . Glycosylation of the 18-kDa 

35 species was also shown by lectin binding studies 
performed after translation products were size 
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fractionated by SDS-PAGE and transferred to membranes. 
The 18 -kDa protein was recognized by ConA, whereas the 
16 -kDa protein did not bind ConA (FIG. 10B) . 
Modification of translated protein by the microsomal 
5 membranes was specific for gpl8. Translation of RNA 
encoding BDV p23, which encodes a potential N- 
glycosylation site (amino acids 53 to 55) , included as a 
negative control for in vitro glycosylation, was not 
influenced by the presence of microsomal membranes (FIG. 
10 10A) . 

DISCUSSION 

We have isolated and partially characterized a BDV 
glycoprotein with unusual properties. This protein, 

15 previously reported as 14.5 kDa {SchSdler, R., et al., 
J. Gen Virol, 66:2479-2484 (1985)}, is 16.2 kDa prior to 
carbohydrate modification and -18 kDa after 
glycosylation. Though no classical sites for N linkage 
(N-x-S/T {Marshall, R. D., Armu. Rev. Biochem, 41:673-702 

20 (1972)}) are found in the gpl8 sequence, the protein is 
readily modified in vitro in the presence of a microsomal 
membrane system capable of N glycosylation {Gahmberg, C. 
G. r et al., p. 281-297, In S. Fleischer and B. Fleischer 
(ed.), Biomembranes , Academic Press, New York (1983) and 

25 Lau, J. T. Y. , et al., J. Biol. Chem., 258:15255-15260 
(1983)}. In addition, gplB is sensitive to AT-glycosidase 
F, an enzyme which cleaves between asparagine and N- 
acetylglucosamine {Plummer, T. H., et al., J. Biol Chem. , 
259:10700-10704 (1984) and Tarentino, A. L. , et al., 

30 Biochem., 24:4665-4671 (1985)}. These findings indicate 
that gp!8 is N glycosylated at a nonclassical site. One 
potential site is N-I-Y (amino acids 74 to 76) . The 
presence of a hydroxy 1 amino acid (T or S) or cysteine in 
position +2 (N-x-T/S or C) has been proposed as essential 

35 for hydrogen bond donor function in N glycosylation 
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{Bause, E. , et al.. Biochem. J, 195:639-644 (1981)}. It is 
possible that tyrosine (Y) , another hydroxyl amino acid 
in position +2, could serve as a hydrogen bond donor in 
gplfl. A second potential site for N glycosylation is L- 
5 N-S-L-S (amino acids 87 to 91) . which is similar to S-N- 
S-G-phosphorylated S, the site for N glycosylation in a 
glycopeptide from hen yolk phosvitin {Shainkin, R., et 
al., J Biol. Chem., 246:2278-2284 (1971)}. 

gplB is sensitive to endoglycosidase F, an enzyme 
10 that cleaves after the N-linked JV-acetylglucosamine in 
high mannose-, biantennary hybrid-, and biantennary 
complex-type oligosaccharides {Tarentino, A. L., et al.. 
Biochem., 24:4665-4671 (1985) and Tarentino, A. L. , et al . , 
Methods Enzymol. , 230:44-57 (1994)}. The protein is not 
15 sensitive to endoglycosidase H, an enzyme which cleaves 
after the N-linked Af-acetylglucosamine in high-mannose- 
and most hybrid-type oligosaccharides but does not cleave 
complex- type oligosaccharides {Trimbel, R.B., et al., Anal. 
Biochem., 141:515-522 (1984)}. Lectin staining using G. 
20 nivalis agglutinin shows no evidence of terminal mannose 
characteristic for hybrid- and high-mannose-type 
glycosylation. In contrast, staining with ConA (mannose, 
tf-acetylglucosamine, branched trimannosyl core) {Ogata, 
S., et al., J. Biochem., 78:687-696 (1975)}, wheat germ 
25 agglutinin (tf-acetylglucosamine) , and BS-II (terminal N- 
acetylglucosamine) {Ebisu, S., et al., Methods Enzymol. , 
50:350-354 (1978)} indicates the presence of terminal N- 
acetylglucosamine and internal mannose. Thus, there is 
evidence from the pattern of endoglycosidase sensitivity 
30 and lectin staining that gpl8 is likely to be a 
biantennary complex- type glycoprotein. 

gpl8 is sensitive to endo-0-galactosidase . This 
enzyme cleaves between galactose and either N- 
acetylglucosaraine or galactose when these saccharides 
35 occur in unbranched sequence {Scudder, P., et al., J.Biol. 
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Chem., 259:6586-6592 (1984)}. The presence of galactose 
was confirmed by BS-I lectin binding (FIG. 8) . The 
presence of both #-acetylglucosamine and galactose was 
confirmed by high-performance anion-exchange 
5 chromatography with pulsed amperometric detection. The 
combination of acetylgalactosamine and galactose is 
usually found in O-linked carbohydrates {Hayes, B. K. , et 
al., 1 Biol Chem. 268:16170-16178 (1993)}. Though it is 
possible that gpl8 is both N and O glycosylated, N- 

10 acetylgalactosamine has also been reported to occur in 
complex-type N-linked glycosylation {Hayes, B. K. , et 
al., J. Biol Chem. 268:16170-16178 (1993)}. 

We did not detect a monocistronic ~429-nt mRNA for 
gpl8 by PCR using oligo(dT), a 5' sense primer, and 

15 template from a variety of sources, including infected 
cell lines and rat brain. In contrast, a 429-nt gpl8 
cDNA was readily amplified by using gene-specific primers 
and total RNA or poly (A)* RNA as a template. Northern 
(RNA) hybridization experiments with gpl8-specif ic probes 

20 using total RNA or poly (A)* RNA from infected cells or rat 
brain detected only 1.5-, 2.8-, 3.5-, 6.1-, and 7.1-kb 
transcripts. Recent experiments confirmed that the 1.5- 
and 2.8-kb RNAs can serve as templates for in vitro 
translation of the gpl8 (data not shown) . These data 

25 suggest that gp!8 is likely to be translated from one or 
more of the larger RNA transcripts . 

The role of gpl8 in the BDV life cycle remains to be 
determined. Though the virus has not been characterized 
morphologically, genetic analysis has characterized BDV 

30 as a member of the order Mononegavirales {Briese, T., et al., 
Proc. Natl Acad. Sci USA 91:4362-4366 (1994) and Cubitt, B., et 
al., J.Virol, 68:1382-1996 (1994)}. In nonsegmented, 
negative -strand RNA viruses, the third gene usually 
directs expression of a matrix protein. Matrix proteins 

35 in members of the order Mononegavirales are not known to be 
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glycosylated; however, glycosylated matrix proteins that 
resemble gpl8 in size and pi (-10) have been found in 
other viral systems (e.g.. El in coronaviruses {Armstrong, 
j., et al., Nature (London), 308:751-752 (1984)}). 
5 Preliminary observations suggest that gpl8 is present on 
the surface of the viral envelope. Monospecific antisera 
and monoclonal antibodies to gpl8 precipitated viral 
particles and had neutralizing activity. In contrast, 
antibodies to p40 and P 23 did not precipitate viral 

10 particles or neutralize infectivity (see Example 4 
below). Preincubation of primary rabbit fetal glia 
(cells highly susceptible to BDV) with gpl8 prevented 
infection. No such effect was observed with either p40 
or p23. Last, gpl8 and BDV particles compete for binding 

15 to a -100-kDa membrane protein present in cells 
susceptible to infection. 

ExpT-aBBi"" , Rec ombinant P57_ 

cDNAs representing the p57 ORF were amplified by RT- 
20 PCR using BDV (strain He/80) -rat brain RNA as template. 
The amplified p57 cDNA was subcloned into two plasmid 
vectors, pET21b (Novagen) and pSFV-1 (GIBCO BRL) . 

pET21b, a prokaryotic expression vector, was 
selected because it allows for tight control of protein 
25 expression, an important feature for expression of 
proteins toxic to host cells. The N- terminus of p57 
contains a hydrophobic sequence that confers extreme 
toxicity to prokaryotic cells. Therefore, to facilitate 
the expression of p57, the first 152 N- terminal amino 
30 acids were excluded during the cloning. PCR amplified 
cDNA representing nucleotides 2697 to 3743 of P 57 ORF 
(amino acids 153 to 503) was generated by using 
oligonucleotide primers designed with a 5' restriction 
site (BamHl for sense primer; Xhol for antisense primer) . 
35 The PCR product was cloned into pET21b at the BamHl and 
Xhol restriction sites, thus generating pET21b-BDV57 153 . 5 o 3 . 
The pET21b-BDV57 153 _ 503 plasmid was transformed into BL21 
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host cells and recombinant protein was expressed and 
purified by using protocols provided by the manufacturer. 

An eukaryotic expression system, which allows for 
posttranslational modification, was selected for the 
5 expression of a recombinant protein more similar to 
native p57. pSFV-1 is a eukaryotic expression vector 
that can be used to generate a replication defective 
Semliki Forest virus (SFV) genomic RNA. The entire p57 
0RF was PCR amplified and cloned into pSFV-1 prepared 
10 with 3' T-overhangs at the Smal site, thus generating 
pSFV-BDV57. Transfection of pSFV-BDV57 transcripts into 
mammalian cells, results in overexpression of the 
posttranslationally processed p57 gene product. 

15 EXAMPLE 3 

ELIS A for the Detection of Antibodies to Borna Disease 
Virus Proteins 

We have expressed p40, p23 and gpl8 as recombinant 
proteins and established a sensitive, specific ELISA for 

20 analyzing immunoreactivity to BDV. This assay system is 
more sensitive and rapid than methods currently employed 
for serologic diagnosis of infection such as Western 
blot, indirect immunof luorescent test (IFT) or 
immunoprecipitation . 

25 This system provides a convenient tool for 

diagnosing disease, determining the prevalence of 
infection in animal and human populations and mapping the 
antigenic determinants for the immune response in 
infected hosts; 

30 

MATERIALS AND METHODS 

Infection of Animals and Cultured Cells. 

Six week old Lewis rats (Charles River) were 
infected intranasally with 6 x 10 4 focus forming units 
35 (ffu) of BDV strain He/80-1 {Carbone, K., et al., J. Virol , 
61:3431-3440 (1987) and Schneider, P. A., et al., J. Virol, 
68:63-68 (1994)}. C6 cells were persistently infected 
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with BDV He/80-1 (C6BDV) {Carbone, K. M. , et al., J. Virol, 
67:1453-1460 (1993)}. Rabbit fetal glial cells were 
infected with BDV He/80-1 at a multiplicity of one ffu 
per cell then passaged once before use in I FT assays. 
5 BDV strain He/80 was originally isolated from infected 
horse brain, passaged twice in rabbits, three times in 
rabbit fetal glial cells, and twice in Lewis rats 
{Herzog, S., et al., Med. Microbiol Immunol, 168:153-158 
(1980)}. He/80-1 was passaged four additional times in 
10 Lewis rats and used for infection of animals and cell 
lines . 

Generation of Recombinant Proteins (recp40, recp23, and 
recpl8) 

15 Full length cDNAs encoding p40, p23 or gpl8 were 

cloned into the prokaryotic expression vector P ET15b 
(Novagen) for production of recombinant proteins. pBDV- 
40 in pcDNA II {McClure, M. A. , et al., J- Virol, 66:6572- 
6577 (1992)} was amplified using the primers p40Xho I 
20 (5'- CCCTCGAGGACCAAGATTT-3 ' ) (SEQ ID NO: 17) and Sp6 
(20mer, Promega Corp., Madison, Wisconsin). pBDV-23 in 
pBluescript SKII+ {Thibault, K. J., M.S. thesis, 
University of California, Irvine (1992)} was amplified 
with the primers P 24Nde I (5'- AGAATCATATGGCAACGCGACCATC- 
25 3') (SEQ ID NO: 18) and T7 (20mer Promega). Polymerase 
chain reaction was performed using Taq polymerase 
(Perkin-Elmer Cetus Corp., Norfolk, Connecticut) 
according to the manufacturer's protocol. Products 
amplified from pBDV-40 and pBDV-23 were phenol /chloroform 
30 extracted, precipitated and digested with BamH I and 
either Xho I (pBDV-40) or Nde I (pBDV-23) (Promega Corp., 
Madison, Wisconsin) . pBDV-cpl8 in pBluescript SKII+ (see 
Example 2 above) was digested with Xho I and BamH I. 
Digested fragments were purified by agarose gel 
35 electrophoresis (USB, USBioclean, Cleveland, Ohio) and 
cloned into pET15b (Novagen Corporation, Madison, 
Wisconsin). Protein expression in plasmid containing 
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Escherichia coli cells was induced by addition of isopropyl-0- 
thiogalactopyranoside (1 mM) for 3 hours at 37°C. 
Proteins (recp40, recp23 # and recpl8) were purified by 
nickel -chelate affinity chromatography according to 
5 manufacturer's instructions (Novagen Corp.). 
Purification was assessed by SDS-PAGE and antigenicity 
was confirmed by Western blot using sera from infected 
rats. Proteins were dialyzed against 150 mM NaCl and 2.5 
mM CaCl 2 and digested with biotinylated thrombin (1 

10 unit/mg recombinant protein, Novagen Corp.) overnight at 
room temperature. Thrombin was removed using 

streptavidin- agarose {Novagen Corp.) according to 
manufacturer's protocol. Protein concentrations were 
estimated by BioRad protein assay according to 

15 manufacturer's instructions. 

Antibodies to BDV and recombinant BDVproteine 

Sera were collected from infected rats at time of 
sacrifice or by tail bleeding at 2 -week intervals after 

20 inoculation with BDV. Antibodies to recp40 and recp23 
were each produced in two rabbits. Animals were injected 
subcutaneous ly (s.c.) with 25 jig of protein in Freund's 
complete adjuvant and then boosted 3 weeks later s.c. 
with 25 fig of protein in Freund's incomplete adjuvant. 

25 After 6 weeks some animals received an additional s.c. 
injection of 25 fig protein in Freund's incomplete 
adjuvant. Blood was collected at 2 -week intervals during 
weeks 7 through 14 for detection of antibodies by Western 
blot and ELISA. 

30 

Indirect Ixmrtunof luorescent Test (IFT) 

Rabbit fetal glial cells were processed for 
titration of serum antibodies against BDV using the 
immunohistochemical methods of Pauli et al. {Pauli, G., 
35 et al., Zbl. Vet. Med [BJ 31:552-557 (1984)}. Briefly, 
infected and noninfected cells were fixed with 4% 
formaldehyde in PBS, permeabilized with 1% Triton X-100 
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in PBS and blocked with 1% fetal bovine serum (PBS) in 
PBS. After incubation with sera diluted in 1% FBS in 
PBS, cells were incubated with fluorescein- conjugated 
goat anti-rat IgG and IgM or goat anti-rabbit IgG (Sigma 
Chemical Co., St. Louis, Missouri) diluted 1:200 in 1% 
FBS in PBS and then examined by fluorescent microscopy. 
The IFT titer for each serum was determined to be the 
endpoint dilution at which specific inununof lourescence 
was detected. 

Sodium dodecyl sulfate polyacrylamide gel electrophoresis 
(SDS-PAGE), Western blot (WB) and Imnrunoprecipitation 
(IP) 

For WB, lysates from infected and noninfected C6 
cells were prepared according to Bause-Niedrig, et al. 
{Bause-Niedrig, I., M. et al., Vet. Immunol. Immunopathol . , 
31:361-369 (1992)}. Proteins from these lysates (30 /ig) 
and recombinant BDV proteins (250 ng) were subjected to 
12% SDS-PAGE {Laemmli, U. K. , et al., J. Mol. Biol. , 80:575- 
581 (1973)} and then transferred to nitrocellulose 
membranes (Schleicher & Schuell, Keene, New Hampshire) 
{Towbin, H., et al . , Proc. Natl. Acad. Sci. USA, 76:4350-4354 
(1979)}. Membranes were incubated at room temperature 
first with WB-diluent (0.5% nonfat dry milk (Carnation 
Company, Los Angeles, California) and 0.05% Tween-20 
(Fisher Scientific, Raleigh, North Carolina) in TBS (tris 
balanced saline, 50 mM Tris-HCl pH 7.5 and 150 mM NaCD) 
for one hour, then overnight with various dilutions (1:10 
to 1:2,000) of rat sera or monospecific rabbit sera in 
WB-diluent. Membranes were washed 3 times in TBS, 
incubated for 2 hours with the appropriate secondary 
antibody (horseradish peroxidase -conjugated goat anti-rat 
IgG and IgM or goat anti-rabbit IgG, Sigma Chemical Co., 
St. Louis, Missouri) diluted 1:500 in WB-diluent, washed 
> 5 times in TBS and then incubated with hydrogen peroxide 
and 4-chloro-l-naphthol (Pierce Chemical Company, 
Rockford, Illinois) according to manufacturer's 
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instructions. Methods for synthesis and analysis of 
radiolabeled BDV proteins and iminunoprecipitation have 
been described {Lipkin, W. I. f et al., Proc. Natl Acad ScL, 
USA, 87:4184-4188 (1990)}. Briefly, plasmid clones pBDV- 
5 gpl8, pBDV-23 and pBDV-40 were linearized and used as 
template for in vitro transcription and translation of [ 35 S] 
methionine -labeled proteins. After precipitation with 
rat or rabbit sera and protein A-sepharose (Sigma 
Chemical Co., St. Louis, Missouri), proteins were 
10 analyzed by SDS - PAGE and autoradiography. 

ELISA 

Ninety- six well, Immulon I microtiter plates with 
lids (Dynatech Laboratories , Chantilly, Virginia) were 

15 coated overnight at 37°C with 10 ng of recombinant 
protein per well in 100 /il of borate buffer (100 mM boric 
acid, 50 mM sodium borate and 75 mM sodium chloride, pH 
8.4) . Plates were washed three times with washing buffer 
(0.05% Tween-20 in PBS) and incubated for 1 hour at 37°C 

20 with ELISA-diluent (0.5% bovine serum albumin (BSA) 
fraction V (USB) in washing buffer) . Two- fold serial 
dilutions of sera were prepared in ELISA-diluent; 100 /il 
of sera diluted from 1:250 to 1:500,000 was then added to 
each well and incubated for 2 hours at 37°C. Plates were 

25 washed three times with washing buffer. Next, 100 /xl of 
horseradish peroxidase -conjugated goat ant i- rat IgG and 
IgM (Sigma Chemical Co., St. Louis, Missouri) diluted 
1:5,000 in ELISA-diluent were added to each well and 
incubated for 1 hour at 37°C. After washing the plates 

30 five times, 100 /il of substrate solution was added to 
each well. Substrate solution consisted of 9.9 ml of 100 
mM sodium acetate adjusted to pH 6.0 with 100 mM citric 
acid, 100 /zl of 10 mg of 3 , 3 ' , 5, 5 ' -tetramethylbenzidine 
(Sigma Chemical Co., St. Louis, Misouri) per ml in 

35 dimethyl sulfoxide and 1.5 fil of 30% hydrogen peroxide 
(Fisher Scientific, Raleigh, North Carolina) . After 
incubation in the dark at room temperature for 30 
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minutes, the reaction was stopped by the addition of 50 
M l of 25% sulphuric acid (Sigma Chemical Co., St. Louis, 
Missouri) to each well. The absorbance at 450 nro was 
determined for each well using a microplate reader 
(Molecular Devices, Thermo max, Menlo Park, California) . 
Negative control wells, without primary antisera, were 
used for calibration. The ELISA titer for each serum was 
defined as the endpoint dilution that yielded an optical 
density of 0.3. 



10 



20 



RESULTS 

The figures below present some of the results: 
FIG. 11. Western blot analysis of native and 
recombinant proteins with monospecific antisera to 
15 recombinant proteins and sera from infected rats. 
Recombinant viral proteins and lysates from infected 
C6BDV or noninfected C6BDV cells were size-fractionated 
and screened by Western blot. A) Sera from infected and 
noninfected rats were used to detect native or 
recombinant proteins. Lane 1, C6BDV lysate; lane 2, 
recp40; lane 3, recp23; lane 4, recplS; lane 5, C6 
lysate; lane 6, recp40, recp23 and recplB. Lanes 1-4 
were treated with serum from infected rat; lanes 5 and 6 
were treated with serum from noninfected rat. B) 
Monospecific antisera were used to detect BDV-specific 
proteins. C6BDV lysates (lanes 1-3) and C6 lysates 
(lanes 4 and 5) were incubated with: lanes 1 and 4, serum 
from infected rat; lane 2, anti- P 40 rabbit serum; lane 3, 
anti-p23 rabbit serum; and lane 5, pooled anti-p40 and 

30 anti-p23 sera. 

FIG. 12. ELISA of infected rat serum reacted with 
recp40. ELISA was performed with 10 ng/well recp40 or 
BSA as described in Materials and Methods. Circles, 
recp40 and serum from chronically infected rat; squares, 
recp40 and serum from noninfected rat; triangles, BSA and 
serum from chronically infected rat. 
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FIG. 13. Timecourse for appearance of antibodies to 
BDV-proteins. Sera were collected at different times 
post -infect ion and assayed by EL ISA for antibodies to (A) 
recp40; (B) recp23; and (C) recpl8. Error bars represent 
5 standard error of the mean. Number of animals analyzed 
at each time point: <4 wks, 15; 5 wks, 6; 6 wks, 12; 8 
wks # 4; 10 wks, 5; and 15 wks, 9. 

Production of recombinant viral proteins and monospecific 
10 antisera to recombinant viral proteins 

Pull length coding sequences for p40, p23 and gpl8 
were expressed in Escherichia coli and recombinant proteins 
were purified. The yield of protein in 100 ml of 
bacterial culture was: recp40 # 1 mg; recp23, 500 fig; and 

15 recp!8, 50 ng. Recombinant proteins were analyzed by 
SDS-PAGE. A predominant band of the expected molecular 
weight was observed for each protein and tested for 
antigenicity by WB using sera from BDV- infected and 
noninfected rats (FIG. 11A) . Recombinant proteins were 

20 detected by sera from BDV- infected rats but not by sera 
from noninfected rats. Recombinant proteins, recp40 and 
recp23 were used to produce antibodies in rabbits. The 
production of antibodies was monitored by ELISA. Rabbits 
were sacrificed when the ELISA titer reached 1:500,000 

25 (week 16 of immunization) . The specificity of the 
antisera was then tested by WB using lysates from 
infected cells and recombinant proteins (FIG. 11B) . 
Antisera were monospecific: rabbits immunized with 
recp40 produced antibodies that reacted only with p40 and 

30 recp40; rabbits immunized with recp23 produced antibodies 
that reacted only with p23 and recp23. At week 16 of 
immunization, the antisera were also titered by I FT. 
Antisera to recp40 and recp23 had I FT titers of 1:50,000 
and 1:100,000, respectively. 



35 
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Specificity and sensitivity demonstrated in the BDV-ELISA 
systems 

In order to establish a sensitive and specific ELISA 
for all three recombinant BDV proteins, the optimal 
5 antigen concentration was determined by checkerboard 
titration of positive and negative sera versus various 
antigen concentrations. For each protein, the 

concentration that resulted in the most linear response 
was 10 ng/well. The sensitivity of the ELISA system for 

10 each recombinant protein was established using sera from 
infected rats known to be reactive by IFT, IP and WB. 
For each of the proteins, 100% of sera that had been 
found to be positive by other methods were also positive 
by ELISA. Specificity was tested using sera from 15 

15 noninfected rats. ELISA for each protein proved to be 
highly specific for detection of antibodies to BDV 
proteins: recp40 -ELISA with noninfected rat sera showed 
80% specificity at 1:500 dilution or 100% specificity at 
1:2,000, recp23-ELISA showed 93% specificity at 1:250 and 

20 100% specificity at 1:1,000. recpl8-ELISA showed 100% 
specificity at 1:250. Figure 12 shows a representative 
ELISA using recp40 as target antigen. Various dilutions 
of sera from chronically infected and noninfected rats 
were tested with 10 ng of recombinant protein or BSA per 

25 well in comparison with BSA. No nonspecific background 
reactivity was observed at serum dilutions of 1:500 or 
higher (FIG. 12) . Results were similar when recp23 and 
recpl8 were used as target antigen. 

30 Analysis of iamunoreactivity to viral proteins bv IFT WB, 
IP and ELISA in Bera from infected rats 

Adult rats infected intranasally with BDV did not 
display abnormal behaviors prior to the fourth week post- 
infection (predisease, PD) . Four to six weeks post- 
35 infection, in the acute phase of disease (AD) , animals 
had hyperactivity, weight loss, disheveled fur, dystonic 
posture and hindlimb paresis. Eight to fifteen weeks 
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post -infection, signs of disease sterilized: there was 
no additional weight loss, hyperactivity diminished and 
paresis did not progress. This chronic phase of the 
disease (CD) persisted for the life of the animals. Sera 
5 was collected from adult -infected rats between 3 and 15 
weeks after infection with BDV, and analyzed for the 
presence of antibodies to viral proteins using four 
different methods: IFT, WB, IP and EL ISA (Table 3). 
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I FT allowed detection of antibodies to BDV in both 
AD rats and CD rats. In AD rats, the titer was between 
1:20 and 1:200, whereas in CD rats, the titer was between 
1:10,000 and 1:20,000. Sera from PD rats were not 
5 reactive by IFT. WB using lysates from infected cells or 
recombinant proteins, and IP using proteins translated in 
vitro yielded identical results: sera from CD animals were 
reactive with p40, p23 and gpl8; sera from AD rats 
detected only p40 and p23; sera from PD rats did not 

10 react with p40, p23 or gpl8. ELISA detected antibodies 
reactive with p40, p23 and gpl8 in sera from all CD and 
AD rats (Table 3) . In PD rats, ELISA only detected 
antibodies reactive with p40 and p23; immunoreactivity 
with gpl8 was below specificity (Table 3) . 

15 The timecourse for the appearance of antibodies to 

BDV-proteins in sera was determined by ELISA. Sera 
collected at regular intervals from adult -infected rats 
were tested in the recp40, recp23 and recpl8 ELISA 
systems. Titers of antibodies to all three proteins 

20 increased throughout the period of observation from weeks 
4 to 15 post infection (FIG. 13) . 

DISCUSSION 

Three recombinant BDV proteins, recp40, recp23 and 
25 recp!8, were expressed and used as immunogens for 
production of monospecific sera in rabbits. Two of these 
antisera, directed against recp40 and recp23, are 
reported here; antisera to recp!8 are described in 
Example 4 below. These three recombinant proteins were 
30 detected by sera from infected rats (FIG. 11A) and by 
monoclonal antibodies to purified native proteins. 
Monospecific antisera to the recombinant proteins were 
immunogen-specific as determined by WB (FIG. 11B) and 
detected proteins in infected cells by IFT. 
35 ELISA systems were established, based on recombinant 

proteins, that have several advantages over methods 
currently used for detection of BDV-specific antibodies 
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including IFT, WB and IP. Although IFT is widely 
accepted as a method for diagnosing BDV infection and 
titering antibodies to the virus, it has two 
disdavantages. First, IFT does not define the viral 
protein (s) responsible for immunoreactivity . Second, as 
shown here, IFT titers are 10-100 fold less sensitive 
than ELISA for detection of antibodies to p40 or p23. 
This relative insensitivity resulted in failure of IFT to 
show evidence of infection in PD rats (Table 3) . WB and 
IP allowed detection of antibodies to individual viral 
proteins but were also less sensitive than ELISA. Sera 
from PD rats were not reactive by either WB or IP. 

For diagnostic purposes, the recp40-ELISA is the 
most sensitive method for detection of antibodies in 
infected animals. Antibodies to recp40 were present 
prior to disease onset and had higher titers than 
antibodies to recp23 or recplS. Although the recp23- 
ELISA was also positive in PD and AD rats, the recplS- 
ELISA was not. Because high titer antibodies to gplB 
only appear in chronic disease, the recplS -ELISA may be 
used to estimate the duration of infection. Low antibody 
titers to recplS are not due to the lack of glycosylation 
on this recombinant protein because similar ELISA titers 
were found with native gplS antigen. Failure to produce 
high titer antibody response to recplS may be due to 
lower levels of expression of this protein than p40 or 
p23. 

Growing recognition that BDV has a broader species 
and geographic range than previously appreciated suggests 
the importance of designing sensitive, reliable assays 
for infection. The ELISA systems described here, provxde 
inexpensive, rapid methods for BDV-serology. In contrast 
to IFT, WB and IP, which require at least 2 days for 
completion and are not well suited to screening multiple 
samples, ELISA allows analysis of hundreds of sera in 
several hours with only minimal equipment. Plates coated 
with these proteins have been stable in ELISA for up to 
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one month at room temperature and thus are practical for 
use in remote laboratories. In addition to serving as a 
tool for clinical diagnosis and epidemiology of Borna 
disease infection, the BDV ELISA is a useful tool for 
5 studies in immunopathogenesis and virus biology. For 
example, applicants have mapped antigen binding sites on 
p40 and p23 by ELISA using sera from infected animals and 
monoclonal antibodies to BDV proteins. 

Dependent on the population studied and the methods 

10 used for analysis {WB, IP or I FT) , the prevalence of 
antibodies reactive with BDV proteins in patients with 
neuropsychiatric disorders has been estimated to be 
between 4% and 23% {Bode, L. , In W. I. Lipkin and H. 
Koprowski (ed.), Borna Disease. Springer -Verlag, 

15 Heidelberg, in press (1995)}. Variability between 
laboratories could be due to differences in populations 
analyzed, antigen preparations or experimental technique. 
The BDV ELISA based on recombinant proteins provides a 
standardized method for investigating human 

20 immunoreactivity to this neurotropic infectious agent. 

EXAMPfrg 4 

Neutralizing An tibodies in BDV Infected Animal b 

We examined the timecourse for the development of 
25 neutralization activity and the expression of antibodies 
to individual BDV viral proteins in sera of infected 
rats. The appearance of neutralizing activity correlated 
with the development of immunoreactivity to gpl8, but not 
p40 or p23. Monospecific and monoclonal antibodies to 
30 native gpl8 and recombinant non-glycosylated gpl8 were 
also found to have neutralizing activity and to 
immunoprecipitate viral particles or subparticles. These 
findings suggest that gpl8 is likely to be present on the 
surface of the viral particles and to contain epitopes 
35 important for virus neutralization. 

Antibodies to p40 and p23 (soluble antigens) are 
readily detected in both sera and cerebrospinal fluid 
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(CSF) of naturally and experimentally infected animals 
{Ludwig. H. , et al., Progr. Med Virol. , 35:107-151 (1988); 
Ludwig, H., et al., Arch. Virol. 55:209-223 (1977) and 
Ludwig, H., et al., Med Microbiol. Immunol, 163:215-226 
5 (1977)}. Antibodies to gpl8, a membrane -associated 
glycoprotein (previously described as 14.5 kDa) , have 
been reported less frequently {Ludwig, H., et al., Progr. 
Med Virol, 35:107-151 (1988) and Rubin, S. A., et al., J. 
Virol, 67:548-52 (1993)}. Although neutralization 
10 activity has been found in sera of animals infected with 
BDV {Banner, K. , et al., Zbl Vet. -Med [B] , 25:345-355 (1978); 
Hirano, N. , et al . , J. Gen. Virol. . 64:1521-1530 (1983); 
Ludwig, H., et al., Progr. Med Virol. , 35:107-151 (1988) and 
Ludwig, H., et al., Arch. Virol. [Suppl] 7 :lll-133 (1993)}, the 
15 antibodies responsible for neutralization activity have 
not been investigated. An enzyme-linked immunosorbent 
assay (ELISA) based on recombinant BDV proteins has been 
established in Example 3 above, that provides a sensitive 
method for detection of antibodies to gpl8. We find that 
20 the appearance of neutralizing antibodies in infected 
rats correlates with immunological reactivity to gpl8 . 
Furthermore, monospecific and monoclonal antibodies 
(MAbs) directed against gpl8 neutralize BDV infectivity 
and immunoprecipitate viral particles or subparticles. 



25 



MATERIALS AND METHODS 

BDV infected animals: Sixty- thousand focus forming 
units (ffu) of BDV strain He/80-1 {Carbone, K. M. , et 
al., J. Virol, 61:3431-3440 (1987); Herzog, S., et al., Med 
30 Microbiol Immunol. . 168:158-8 (1980) and Schneider, P. A., et 
al., J.Virol., 68:63-68 (1994)} were used to intranasally 
(in) infect each of seventy 6 -week old Lewis rats. 
Rats were observed at three days intervals for weight 
loss, ruffled fur or postural abnormalities consistent 
35 with' acute disease. Sera were collected at time of 
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sacrifice. Under metofane anesthesia, rats were perfused 
with buffered 4% paraformaldehyde; brains were fixed 
overnight in perfusate at 4°C. Twenty -micron sagittal 
sections were collected onto gelatin coated slides and 
. 5 stained with hematoxylin and eosin. Inflammation was 
scored using the scale of Stitz, Sobbe and Bilzer {Stitz, 
L. , et al., J.Virol, 66:3316-23 (1992)}. 

Virus titration and neutralization assay. 

10 Viral infectivity in 20% brain homogenates was 

determined using the method of Pauli et al. {Pauli, G., 
et al., Zbl Vet.-MetL [B] 31:552-557 (1984)}. Virus 
neutralization was performed using a modification of 
Danner et al. {Danner, K., et al., Zbl Vet.-MedfBJ, 25:345- 

15 355 (1978)}. Briefly, 50 ffu of BDV were incubated with 
serial dilutions of antibodies or sera for one hour at 
37°C, added to rabbit fetal glial cells and incubated for 
5 days. Sera was heat inactivated at 56 °C for 30 
minutes. In selected assays, mouse complement (1:50) 

20 (Sigma Chemical Co., St. Louis, Missouri) was added to 
the virus concurrent with the addition of MAbs to 
determine the effects of complement on neutralization 
activity. The dilution of serum or antibody required to 
reduce the number of ffu by 50% was defined as the 

25 neutralization titer (NT 50 ) . As controls for each 
neutralization assay, rabbit fetal cells were exposed to 
medium without virus, treated with virus in medium alone 
(no antibodies) , or treated with virus incubated with 
sera from normal rats. Pilot studies showed that 

30 approximately 8% of normal rat sera interfered with BDV 
infectivity at dilutions up to 1:16. Therefore, sera 
were considered to be neutralizing only if the NT 50 
exceeded 1:32. Supernatant from nonproducing myeloma 
cell lines as well as monoclonal antibodies directed 

35 against BDV-p23 (24/36F1) and BDV-p40 (38/17C1) 
{Thiedemann, N., et al., J. Gen. Virol. , 73:1057-1064 (1992)} 
were found to neutralize infectivity at dilutions of 1:2. 
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Thus, monoclonal antibodies were considered to be 
neutralizing only if the NT50 exceeded 1x4. 

Preparation of proteins <recp40, recp23, recplS and 
5 gpl8) t 

Plasmids encoding P 40 {pBDV-40 disclosed in McClure, 
M . a., et al., J.ViroL, 66:6572-6577 (1992)}, P 23 {pBDV-23 
disclosed in Thibault, K. J., M.S. thesis; University of 
California. Irvine (1992) } and gpl8 { P BDV-gpl8 disclosed 
10 in Kliche, S. et al. J. Virol, 68:6918-6923 and Example 2 
above) were subcloned (see Example 3 above) into the 
prokaryotic expression vector petlSb (Novagen, Madison, 
Wisconsin). Recombinant proteins (recp40, rec P 23 and 
recplS) were expressed in Escherichia coli and purified 

15 according to manufacturer's protocol (Novagen, Madison, 
Wisconsin) . Purity and antigenicity were assessed by 
SDS-PAGE and Western blot analysis using sera from 
infected rats. Native, glycosylated gpl8 was prepared 
from infected rat brain as described previously 

20 {Schadler, R., et al., J. Gen. Virol, 66:2479-2484 (1985)}. 

Enzyme -linked immunosorbent assav (ELISA) : 

ELISA was performed as described in Example 3 above. 
Briefly, plates coated with recombinant protein were 

25 incubated with serially diluted sera or MAbs. Bound 
horseradish peroxidase (HRPO) -coupled secondary antibody 
(goat anti-mouse F'ab-HRPO, goat anti-rat igO and IgM 
HRPO; Sigma Chemical Co.) was quantified on a microplate 
reader (Thermo max, Molecular Devices, Menlo Park, 

30 California) using the chromagen 3.3'-5,5' 
Tetramethylbenzidine (Sigma Chemical Co.). The ELISA 
endpoint titer was defined as the serum or antibody 
dilution that generated an optical density of 0.3. 



35 
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Sodium dodecyl sulfate polyacrylamlde gel electrophoresis 
(SDS-PAGE) , Western blot and Immunoprecipitation (IP) : 

Recombinant or native BDV proteins were subjected to 
SDS-PAGE {Laemmli, U. K. , et al. # J.MoLBioL, 80:575-581 
5 (1973) } and transferred to nitrocellulose (Schleicher & 
Schuell, Inc., Keene, New Hampshire) or Imraobilon-N 
membranes (Millipore Corp., Bedford, Maryland) {Towbin, 
H., et al., Proc. Natl Acad Sci. USA, 76:4350-4354 (1979)}. 
Membranes were blocked and incubated with primary 

10 antibody as described in Example 3 above. After 
incubation with secondary antibody (goat antimouse IgG- 
alkaline phosphatase [AP] , goat anti-rat IgG-AP or goat 
anti-rat IgG and IgM-HRPO, Sigma Chemical Co., St. Louis, 
Missouri) , immune complexes were visualized using Western 

15 Blue (Promega, Madison, Wisconsin) for AP or 
chemiluminescence (ECL kit, Amersham, Arlington Heights, 
Illinois) for HRPO according to manufacturer's 
instructions. gp!8 or recpl8 were precipitated using 
sera from infected rats, monospecific antibodies or MAbs 

20 and Protein A-Sepharose (Pharmacia Biotech Inc., 
Piscataway, New Jersey) as described by Persson, H., et 
al. {Persson, H., et al. Science, 225:687-693 (1984)} then 
assayed by Western blot. 

25 Monoclonal antibodies: 

MAbs to gpl8 were generated according to Thiedemann 
et al. {Thiedemann, N. , et al., J. Gen Virol, 73:1057-1064 
(1992)}. Briefly, Balb/c mice were immunized 

intraperitoneally (i.p.) with 5 ng of gpl8 in complete 

30 Freund's adjuvant. Three and 6 weeks after the initial 
immunization, mice were boosted i.p. with 5 fig of gpl8 in 
incomplete Freund's adjuvant. Four days before fusion of 
spleen cells with the mouse myeloma cells X63-Ag8.653 
{Kearney, J. F., et al., J.Immunol, 123:1548-1550 (1979)}, 

35 mice were boosted intravenously with 15 fig of gpl8. All 
hybridomas were initially screened for reactivity to gpl8 
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by ELISA. Tissue culture supernatants from positive 
hybridomas were concentrated by ammonium sulfate 
precipitation {Jonak, Z. L., p. 405-406, In R. H. Kennett, 
T. J. McKean, and K. B. Becktol (ed.>, -Monoclonal 
5 antibodies, Hybridomas: A new dimension in biological 
analyses". Plenum Press, New York (1982)} and tested by 
Western blot and IP for reactivity with gpl8 and recplS. 
The immunoglobulin isotype was determined using an 
agglutination isotyping kit (Serotec, Oxford, England) 
10 according to manufacturer's instructions. Monoclonal 
antibody, 24/36F1 directed against BDV-p23 (Thiedemann, 
N., et al.. J. Gen. Virol. , 73:1057-1064 (1992)}, was used as 
a negative control in Western blot and IP experiments. 

15 Generation of polyclonal sera against recplS protein: 

To produce antibodies against recplS, two 2 -month 
old Lewis rats were injected subcutaneously (s.c.) with 
25 M9 of protein in Freund' s complete adjuvant and 
boosted 3 weeks later with 25 M 9 of protein s.c. in 

20 Freund' s incomplete adjuvant. After 6 weeks, animals 
received i.p. injections of 25 jig protein in phosphate 
buffered saline (PBS) with 20 fig lipopolysaccharide (S. 
typhimurhm. Difco, Detroit, Michigan) at two-week intervals 
for a total of three injections. Serum was collected 

25 every two weeks during weeks 7 through 14 for analysis by 
ELISA and Western blot and for determination of 
neutralization titer. Mouse antibodies to native gplB 
have been described in Example 2 above. 

30 Affinity adsorption of BDV-specif ic serum- antibodies : 

Antibodies that bound to recp23 and recp40 were 
sequentially removed from serum of an infected rat 
according to Crabb et. al. {Crabb, B. S., et al . , Virology, 
190:143-154 (1992)}. Serum (D2) from an adult -infected 
35 Lewis rat (15 weeks post intranasal infection), was 
diluted 1:10 in TBS (tris balanced saline, 50 mM Tris pH 
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7.4 and 100 mM NACl) and incubated overnight at 4°C with 
membrane -bound recp23 . The anti-recp23 antibody-depleted 
serum (D2 Accrecp23) was removed, the membrane was washed 
with TBS and adsorbed anti-recp23 antibodies were eluted 
5 (recp23 eluant) by incubation with 1 ml of 0.1 M glycine, 
0.15 M NaCl pH 2.7 for 3 minutes. The pH of the eluant 
was adjusted by addition of 300 fil of 10 mM Tris HCl pH 
7.5. The anti-recp23 antibody-depleted serum was then 
incubated with membrane -bound recp40 (D2 Aocrecp23, 

10 Aocrecp40) and purified as before (recp40 eluant) . 
Antibody depletion from serum and antibody elution from 
membrane -bound proteins was monitored by Western blot and 
ELISA. At each step during the purification, antibody- 
depleted sera and eluted antibodies were analyzed for 

15 neutralizing activity. Antibodies to gpl8 or recpl8 were 
also adsorbed (D2 Aocgpl8, D2 Accrecpl8) and eluted (gpl8 
eluant, recpie eluant) by this method. These adsorption 
and elution experiments were repeated using serum (B3) 
from an additional adult -infected rat (15 week post 

20 intranasal infection) . 

IP of BDV particles or sub particles and analysis by 
reverse transcription polymerase chain reaction (PCR) : 

Forty- thousand ffu of BDV in a volume of 200 (il 
25 were treated with 50 /xg/ml of DNase I and RNase A 
(Boehringer Mannheim Corp., Indianapolis, Indiana) for 30 
minutes at 37°C then incubated for 2 hours at room 
temperature with 100 pi of one of the following: (1) 
serum from acutely or chronically infected rats at 1:10 
30 dilution in PBS; (2) purified serum- antibodies at 1:10 
dilution; (3) mouse anti-gp!8 sera or rat anti-recpl8 
sera at 1:20 dilution; or (4) monoclonal antibodies 
against gpl8 at 1:5 dilution. Next, 100 pi of 1 mg/ml 
Protein A-Sepharose (Pharmacia, Puscataway, New Jersey) 
35 in PBS was added, and the mixture was incubated overnight 
at 4°C. The Protein A-Sepharose-antibody- virus complex 
was washed three times in PBS then resuspended in 100 pi 
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10 



water. Total RNA was extracted {Chomczynski, P., et al., 
Anal. Biochem. , 162: 156-159 (1987)} and used for RT-PCR 
amplification of a 693 nucleotide region of the viral 
genome (nucleotide 753 to 1446) according to Schneider et 
al. (primer 7 and primer 9) {Schneider, P. A., et al., J. 
Virol, 68:63-68 (1994)}. PCR products were analyzed by 
agarose gel electrophoresis. PCR products were cloned 
and sequenced to confirm that they represented the 
predicted region of the genomic RNA (Schneider, P. A., et 
al., J- Virol., 68:63-68 (1994)}. Negative controls for RT- 
PC r' included the omission of virus from 
immunoprecipitation reactions and the use of genomic 
sense primers during first strand cDNA synthesis. 



15 



20 



RESULTS 

The following figures present part of the results: 
FIG. 14. Timecourse for the appearance of 
antibodies to BDV proteins in sera from individual rats 
after i.n. infection. (A) Neutralization activity in 
sera from BDV- infected rats at three timepoints (5, 10 
and 15 weeks post -infection) . Each serum is represented 
by a circle. Bars indicate mean neutralization titer for 
each group (5, 10 or 15 weeks post - infection) . Asterisk 
represents sera with neutralization titer less than or 
25 equal to 1:16. (B) Plot of mean recplS ELISA titers 
(open columns) with neutralization titers (hatched 
columns) at three time points (5, 10 and 15 weeks post- 
infection) . Sera analyzed were the same as those xn 
panel A Mean values for neutralization activity were 
30 determined as described in FIG. 14A. Arrows indicate 
threshold for significance in neutralization assay (1:32) 
and recplB ELISA (1:250). These values were selected 
because normal rat sera reacted in the neutralization 
assay and reccplS ELISA at titers of 1:16 and 1:125, 
35 respectively. (C) Timecourse for the appearance of 
antibodies to recp40, recp23, and gpl8 by Western blot 
analysis. Proteins were size-fractionated by SDS-PAGE 
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and transferred to nitrocellulose membranes. Membranes 
were incubated first with sera and then with horseradish 
peroxidase-coupled goat anti-rat IgG. Bound secondary 
antibody was detected by chemiluminescence. Results 
5 shown are from serum of one representative animal at 
several different timepoints post BDV infection (p.i.). 

FIG* 15. Monoclonal antibody (MAb) detection of 
gpl8. A) Immunoprecipitation of gpl8 with MAbs. gpl8 
was first incubated with MAbs or sera from infected or 

10 noninfected rats, then precipitated with Protein- A 
Sepharose, size- fractionated by 12% SDS-PAGE and 
transferred to Imraobilon-N membranes- Precipitated gpl8 
was visualized with rat anti-recp!8 sera, goat anti-rat 
IgG-AP, and Western Blue. Lanes 1, serum from infected 

15 rat (15 week p.i.) ; 2, serum from noninfected rat; 3, MAb 
14/29A5; 4, MAb 14/26B9; 5, MAb 14/8E1; 6, MAb 14/13E10; 
7, MAb 14/18H7; 8 r MAb 24/36F1 (MAb directed against the 
BDV 23 kDa protein, negative control); 9, no antibody. 
Arrow indicates gpl8; H and L represent heavy and light 

20 chains of immunoglobulin, respectively. B) MAbs were 
analyzed for binding to native gp!8 in Western blot. 
gpl8 was separated on 12% SDS-PAGE and transferred to an 
Immobilon-N membrane. Strips were incubated with MAbs or 
sera from infected or noninfected rats. Bound antibodies 

25 were detected with alkaline phosphatase conjugated goat 
anti-rat IgG or goat anti-mouse Fab-specific and Western 
Blue substrate. Lanes: 1, serum from infected rat (15 
week p.i., D2) ; 2, serum from noninfected rat; 3, MAb 
14/29A5; 4, MAb 14/26B9; 5, MAb 14/8E1; 6, MAb 14/13E10; 

30 7, MAb 14/18H7; and 8, MAb 24/36F1 (MAb directed against 
the BDV 23 kDa protein, negative control). Molecular 
weight markers (10 3 Da) are shown at the right. 

FIG. 16. Neutralization profile of sera and MAbs. 
BDV (50 ffu) was preincubated with serial dilutions of 

35 serum or MAb and then added to ten thousand rabbit fetal 
glial cells. After four days of incubation, the infected 
cells were visualized as described in Pauli et al. 
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{Pauli, G. , et al., ZbL Vet.-Med [BJ 31:552-557 (1984)}. The 
number of infected cell-foci per well was counted. (A) 
Serum from noninfected rat. (B) serum from infected rat 
(15 week p.i.. D2). (O MAb 14/13E10. (D) MAb 14/29A5. 
5 FIG. 17- Precipitation of BDV using sera from 

infected rats, monospecific rat antisera to recplB and 
monoclonal antibodies (MAbs) to gplB. Virus was treated 
with nucleases to eliminate nucleic acid not contained 
within virions then immunoprecipitated with sera or MAbs 
10 and Protein A-Sepharose. FNA was extracted and subjected 
to RT-PCR to amplify a 693 nucleotide viral genomic 
sequence. PCR-products were visualized in an ethidium 
bromide-stained 1% agarose gel. (A) Precipitation of BDV 
with sera from infected rats. Lanes: 1. serum from 
15 infected rat, 15 week p.i.; 2, serum from infected rat, 
5 week p.i.; 3, serum from infected rat. 15 week p.i., no 
BDV; 4, serum from infected rat, 15 week p.i., genome 
sense primer used for first strand cDNA synthesis. (B) 
Precipitation of BDV by monospecific antisera to recplS 
20 and MAbs to gpl8. Lanes: 1, monospecific rat antisera to 
recplS; 2, MAb 14/13E10; 3, MAb 14/29A5. DNA markers 
(basepairs) are shown at the right. 

Timecourse of disease and appearance of antibodies to BDV 

25 in infected rates t 

Rats developed Boma disease (BD) within 5 weeks 
post infection. The acute phase of the disease, 4-8 
weeks post infection, was associated with marked weight 
loss, disheveled fur. dystonic posture, hind limb paresis 

30 and paralysis, mortality of 35%. and prominent 
inflammatory cell infiltrates in the brain. In the 
chronic phase of disease. 10-15 weeks post -infect ion. 
signs of disease stabilized and inflammation receded. 
Virus titers in the brains of animals acutely (5 weeks 

35 pi) and chronically infected (15 weeks p.i.) were 
2 4±0.4xl0 5 ffu/ml and 4.4±0.2xl0« ffu/ml. respectively. 
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Sera were monitored for virus neutralization 
activity (FIG. 14 A, B and C) and the presence of 
antibodies reactive with recp40, recp23, recpl8 or native 
gpl8 in Western blot (FIG. 14C) and ELISA. 
5 Neutralization activity was first detected in sera (28% 
of the animals) at 5 weeks p.i. By week 15 p.i., all 
sera had neutralization activity with a mean titer of 
1:977±246. Antibodies to recpl8 were first detected by 
ELISA at week 5 p.i. and showed a marked increase in 

10 titer by 15 weeks p.i. (1 :4, 610±1,463) (FIG. 14B) . In 
contrast, antibodies reactive with recp40 and recp23 were 
detected by ELISA within 4 weeks of infection, reached a 
titer greater than 1:20,000 by 8 weeks p.i. and remained 
elevated through 15 weeks p.i. (see Example 3 above). 

15 Antibodies reactive with recp40 and recp23 were detected 
by Western blot between weeks four and five p.i., whereas 
antibodies to gpl8 were detectable only after week 10 
p.i. (FIG. 14C) . 

20 Affinity adsorption of neutralizing sera: 

To determine whether the presence of antibodies to 
gpl8 correlate with neutralization activity, two rat sera 
(D2 and B3, 15 weeks p.i.), were tested in the 
neutralization assay after successive depletions of 

25 antibodies to individual BDV-proteins . Antibodies to 
BDV- specific proteins were removed from D2 rat serum by 
adsorption with membrane -bound protein. The efficiency 
of antibody depletion from serum was monitored by Western 
blot and ELISA. Prior to adsorption, the titers to 

30 recp40 and recp23 were each greater than 1:20,000. 
Following adsorption with recp23, the titer to recp23 
decreased to 1:200. After adsorption with recp40, the 
titer to recp40 decreased to 1:150. Eluted antibodies 
were reactive by ELISA with the proteins used for 

35 adsorption: recp23 eluant titer, 1:5,000; recp40 eluant 
titer, 1:15,000. Serum antibodies remaining after 
adsorption, and eluted antibodies, were then tested for 
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neutralizing activity. The neutralization titer of the 
D2 serum (NT S0 1:1,000-1,500) did not change after 
adsorption with recp23 and recp40 antigens (D2 A«recp23, 
Accrecp40) (Table 4). Antibodies eluted from proteins 
5 recp40 (recp40 eluant) and recp23 (recp23 eluant) had no 
neutralization activity (Table 4) . In contrast, the NT 50 
of the D2 serum decreased from 1:1,000-1,500 to 1:600-700 
after adsorption with recplS (D2 AccrecplS) and to 1:160- 
200 after adsorption with gpl8 (D2 AccgplS) (Table 5). 
10 The neutralization titers of antibodies eluted from 
recplS (recplS eluant) and gpl8 (gpl8 eluant) were 1:60- 
100 and 1:240-400, respectively (Table 4). Similar 
results were obtained with serum from rat B3. 
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TABLE 4 Characterization of serum antibodies 



Serum 



Reciprocal 
NT 


IP-RT-PCR - 


Reciprocal 
rplSELISA 
titer 


1,000-1,500 


+ 


4300-5,000 


1,000-1,500 


+ 


4,000-5,000 


600-700 


+ 


NS* 


160-200 


+ 


800-840 


60-100 


+ 


2,400-3,000 


240-400 


+ 


1,200-2,000 


NS 




NS 


NS 




NS 


320-480 


+ 


>5,000 


160-320 


+ 


>5,000 



Chronic (15 wk p.i. [D21) 

D2 Aarecp23, AarecpW* 

D2 Aarecpl8* 

D2Aagpl8* 

recpl8 eluant* 

gpl8 eluant* 

recp23 eluant* 

recp40 eluant^ 

Ratarecpl8 

Mouseagpl8 



* TP of BDV and detection of genomic RNA by RT-FCR. 

* Chronic rat sera (D2) adsorbed with recombinant (recp23, recp40, recp!8) or 
native (gp!8) protein. 

c NS, not significant. The NT was considered to be not significant below 1:32; 
the recpl8 EUSA titer was considered to be not significant below 1250. 

d Antibodies in chronic rat sera (D2) eluted from recombinant or native 
protein. 
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Monospecific antibodies to recpl8 and gpl8: 

Sera from rats and mice immunized with recp!8 and 
gpl8, respectively, were tested for neutralization 
activity- Neutralizing antibodies in both rats and mice 
5 were detected at 12 weeks post -immunization. Sixteen - 
weeks after immunization with recpl8, 2 rats had 
neutralization titers between 1:320 and 1:480 (Table 4) ; 
sera from 2 mice immunized with gpl8 had neutralization 
titers between 1:160 and 1:320 (Table 4) • 

10 

Monoclonal antibodies to gpl8: 

MAbs were generated against gpl8. Five positive 
clones were identified by ELISA using gpl8 as antigen. 
The MAbs represented three different immunoglobulin 

15 isotypes, yet all contained the kappa light chain (Table 
5) . Although each of the monoclonal antibodies 
immunoprecipitated gp!8 (Table 5 # FIG. 5A) and recpl8 
(Table 5) , only MAb, 14/29A5 reacted by Western blot 
(Table 5, FIG. 15B) . 

20 Concentrated supernatant s from all five MAbs 

neutralized BDV infect ivity (Table 5) . Similar to sera 
from chronically- infected rats (FIG. 16B) , the 
neutralization titer of four MAbs was greatest at highest 
antibody concentration (FIG. 16C) . In contrast, one MAb, 

25 14/29A5, neutralized BDV only when used at a dilution of 
1:400-1:1,000 (FIG. 16D) . Neutralizing sera from 
chronically- infected rats, mice immunized with gp!8 or 
rats immunized with recpl8 (Table 4) had the capacity to 
inhibit 100% of BDV infectivity (FIG. 16B) . In contrast, 

30 MAbs to gpl8, used individually or in concert, inhibited 
a maximum of 68% of BDV infectivity (FIG. 16C and D) . 
Supernatants of two MABS, 14/18H7, NT 50 1:16 and 14/13E10, 
NT 50 1:32, showed cooperativity in neutralization assays; 
pooling of these MAbs resulted in a higher neutralization 

35 titer (NT 50 1:100-150). To determine the extent to which 
neutralization was complement -dependent, neutralization 
activity of MAbs was tested with addition of either 
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active or heat -inactivated mouse complement. No increase 
in neutralization titer was detected with addition of 
mouse complement. Serum from noninfected (normal) rats 
was not neutralizing at dilutions greater than is 16 (FIG. 
5 16A) . 

imaunoprecipitation of BDV with neutralizing antibodies: 

BDV stock was treated with nucleases to eliminate 
free nucleic acids then incubated with sera or MAbs and 

10 Protein A-Sepharose. RNA was extracted from 

immunoprecipitated viral particles or subparticles and 
subjected to RT-PCR for amplification of viral genomic 
RNA. Neutralizing rat sera (FIG. 17A) , monospecific sera 
to recplB (FIG. 17B) or gplB, and D2 serum antibodies 

15 eluted from recplS or gplB precipitated BDV particles. 
Removal of antibodies to recp23, recp40, recplS or gpl8 
did not affect the capacity of neutralizing sera to 
precipitate viral particles. Four MAbs also precipitated 
BDV (FIG. 17B and Table 5). One MAb, 14/29AS, did not 

20 precipitate viral particles at any dilution (1:5, 1:100, 
1-200 or 1:500). Sera from noninfected or two acutely 
infected rats (5 weeks p.i.) (FIG. 17A) did not 
precipitate BDV. Experiments with sera and monoclonal 
antibodies are summarized in Tables 4 and 5. Negative 
25 controls for RT-PCR included the omission of virus from 
immunoprecipitation (FIG. 17A) and the use of genomic 
sense primers for first strand cDNA synthesis (FIG. 17A) . 



30 



35 



DISCUSSION 

The presence (or absence) of neutralizing antibodies 
in BDV- infected animals has been controversial. Some 
reports have not shown evidence for neutralizing 
antibodies {Carbone, K. M., et al.. J. Virol, 61:3431-3440 
(1987); Herzog, S., et al., J. Gen. Virol. 66:503-8 (1985) 
and Narayan, O., et al., J. Inf. Dis.. 148:305-315 (1983)}, 
however, this may reflect different timepoints for 
collection of sera or variation in the assay system for 
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neutralization. Although there are reports of 

neutralizing antibodies in serum and CSF of both 
naturally and experimentally infected animals {Danner, 
K., et al., Zbl Vet.-Med [B], 25:345-355 (1978); Hirano, N. , 
5 et al., J. Gen Virol, 64:1521-1530 (1983); Ludwig, H., et 
al., Progr.Med Virol , 35:107-151 (1988) and Ludwig, H. , et 
al.. Arch Virol [Suppl] 7:111-133 (1993)}, neither the 
timecourse for development of neutralizing antibodies nor 
their target antigens have been characterized. Here, we 

10 show that the neutralizing activity of BDV-rat sera 
increases dramatically from the acute (5 weeks p.i.) to 
the chronic (15 weeks p.i.) phase of disease and provide 
evidence to indicate that neutralization activity is due, 
at least in part, to antibodies that react with a BDV 

15 glycoprotein, gpl8. The timecourse for the appearance of 
neutralizing antibodies seems to correlate with 
immunoreactivity to gpl8. Furthermore, removal of 
antibodies to gpl8 or recpl8 dramatically decreased the 
neutralization titer of BDV-rat sera. In contrast, 

20 subtraction of antibodies to two other viral proteins, 
p4 0 and p23, had no effect. 

Neutralization activity was detected with 
monospecific antiserum against both gpl8 and recpl8 as 
well as with monoclonal antibodies against gpl8. These 

25 MAbs represent three different isotypes, IgM, IgG2b and 
IgG3, indicating that multiple isotypes are capable of 
virus neutralization. Addition of complement did not 
enhance neutralization activity of the MAbs, suggesting 
that the mechanism for neutralization was neither 

30 complement -mediated inactivation of virus nor steric 
hindrance by a complement-MAb- virus complex. 

It is likely that at least three different antibody 
binding sites on gpl8 were involved in neutralization. 
Four MAbs, which immunoprecipitated both gpl8 and recpl8 

35 but did not detect protein in Western blots, presumably 
bound to discontinuous epitopes. The observation that 
use of MAbs 14/13E10 and 14/18H7 in combination, resulted 
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in greater neutralization activity than use of either MAb 
alone, suggests that these MAbs recognized either 
different discontinuous epitopes or different binding 
sites on a single discontinuous epitope. One MAb, 
14/29A5, detected protein in Western blots as well as 
immunoprecipitation assays indicating that it bound to a 
continuous epitope. Unlike the other MAbs, 14/29AS 
neutralized infectivity only after dilution (FIG. 16D) , 
a profile consistent with neutralization by virus 
aggregation as reported in other viral systems {Dimmock, 
N. J. , A. Capron, et al. (ed.), "Current Topics in 
Microbiology and Immunology", Springer-Verlag, Berlin 
(1993) and Outlaw, M. C, et al.. J. Gen. Virol, 71:69-76 
(1990)}. Although all of the gpl8 MAbs detected recplB 
(nonglycosylated protein) , it is possible that there are 
additional epitopes for neutralization which include the 
carbohydrate portion of gplB. 

Sera from chronically- infected rats had greater 
neutralization activity than monospecific sera or 
monoclonal antibodies directed against gplS . Higher 
neutralization activity in sera from infected animals 
could reflect factors that influence epitope presentation 
such as interactions between gplB and other proteins or 
the virion envelope. Alternatively, gplS may not be the 
only BDV protein that elicits neutralizing antibodies. 
Sera from chronically- infected animals retained partial 
neutralizing activity and the capacity to precipitate 
virus after adsorption with gplB. Although this may be 
due to incomplete subtraction of antibodies to gplB 
(Table 4) neutralizing antibodies may be directed against 
other viral proteins as well. For example, an additional 
candidate for a virion surface protein that may elicit 
neutralizing antibodies is P 57. This putative protein 
contains multiple potential N-glycosylation sites and, as 
the product of the fourth ORF on the BDV genome, is in 
the gene position generally occupied by glycoproteins in 
Mononegavircdes {Briese, T. , et al., Proc. Natl Acad. Sci. USA : 
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91:4362-4366 (1994)}, It is contemplated that passive 
administration of neutralizing antibodies or immunization 
with gpl8 and other virion surface proteins can alter BDV 
pathogenesis. 

5 

EXAMPLE S 

Fragments of Borna Disease Virus Proteins ImffiBgasfcivS 
With Sera From Human Schizophrenics and BDV Infected 
Animfrlp 

10 The etiology of schizophrenia, a debilitating 

disease that affects approximately 1% of the world's 
population, is unknown. Higher prevalence in some 
geographic areas, seasonal variation in births of 
subjects who develop disease, increased risk of 

15 schizophrenia in subjects exposed to influenza virus 
during the second trimester in utero and discordance for 
disease in monozygotic twins suggest the possibility of 
an infectious basis {Kirch, D. G. , Schizophrenia Bulletin, 
19:355-370 (1993)}. Borna disease virus has been 

20 implicated in human affective disorders by studies 
reporting that patients have serum antibodies to BDV 
{Rott, R., etal., Science, 228:755-756 (1985)} and the 
presence of viral proteins and nucleic acids in 
peripheral blood mononuclear cells {Bode, L. , etal., Nature 

25 Medicine, 1:232-236 (1995)}. The catecholamine-related 
stereotypic behaviors observed in BDV- infected rats and 
catecholamine system dysfunction present in schizophrenia 
prompted Western blot (WB) studies of sera from 
schizophrenics for antibodies to BDV obtained from 

30 animals infected with the virus {Waltrip, R. w. , .II, etal. , 
Psychiatry Res. , 56:33-44 (1995)}. The present invention 
discloses an ELISA test for schizophrenia and BDV 
infection; fragments and peptides derived from p23 and 
gpl8 which are immunoreactive with sera from 

35 schizophrenics and animals infected with BDV and/or 
immunized with p23 and gpl8. The test is specific, 
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sensitive, fast, easy, and economical. For example, 
indirect immunofluorescence assays (IFT) used in the 
current art does not define the viral protein (s) 
responsible for immunoreactvity . IFT, 
5 immunoprecipitation (IP) , and WB are also less sensitive 
than the ELISA of the present invention and require at 
least 2 days for completion and are unsuitable for 
screening of multiple samples. In contrast, the present 
ELISA provides inexpensive, rapid tests which allow 

10 analysis of hundreds of serum samples, e.g. in several 
hours, with minimal equipment. The advantages of ELISA 
over the prior art diagnostic methods are described in 
further detail in Briese, T. , etal., J. Clin. Microbiol. , 33:348- 
351 (1995) • Besides the detection of schizophrenia and 

15 BDV infections disclosed in this Example, the ELISA 
method disclosed herein is generally applicable for 
studies and detection of neurologic and neuropsychiatry 
diseases and BDV infections in men and animals. 

20 I. Timrnmoraact W lfcy of t»40 . p23 and gplB, with Sera 

Sera from 30 human schizophrenic patients were 
examined by ELISA for immunoreactivity with recombinant 
BDV proteins N (recp40) , P (recp23) and M (recplB) {The 
25 recombinant proteins were produced and the ELISA were 
performed as described in Examples 3 and 4, above, which 
were also described Briese, T., etal., J. Clin. Microbiol. , 
33:348-351 (1995)}. Controls were sera from 30 age and 
sex matched normal subjects and 30 patients with multiple 
30 sclerosis (MS), an autoimmune central nervous system 
(CNS) disease of unknown etiology. Although some sera 
detected N or P but not M, all sera immunoreactive with 
M also detected N and P. Twenty-seven percent of 
schizophrenic subjects (7) had serum antibodies to M, N 
35 and P versus 3% of normal subjects (1) or 0% of MS 
patients (p< 0.0001) (Table 6) . Immunoreactivity of sera 
with all 3 proteins was confirmed by WB using extracts 
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from BDV- infected C6 cells supplemented with gpl8; IP of 
the recombinant BDV proteins and IPT using infected 
rabbit fetal glial cells (the WB, IPT, and IP were 
conducted using the methods known in the art, as 
5 described in Briese, T., et al. , 7. Clin. Microbiol. , 33:348-351 
(1995)}), Sera not reactive with 1 or more of the 3 
proteins in ELISA were also negative in the other assays. 
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II. Selection and Immunoreactivitv of Truncated 

Fragments of p23 and op16 

The immunologic determinants on p23 and gpl8 were 
determined using truncated fragments of these proteins . 
5 The fragments used are shown in Figs. 20B and 21B. In 
FIG. 20B, the fragments are designated SI to S4 and NS, 
respectively. In FIG. 21B # the fragments are derived 
from the unglycosylated version of recpl8, and are 
denoted Ml to M4 and MS. The fragments are shown from 

10 the amino terminus (left) to the carboxyl terminus 
(right) of the proteins. The numbers below each fragment 
indicate the locations of the amino acids on the full 
length p23 or gp!8, respectively. For example, p23 has 
a total of 201 amino acids, thus, as shown in FIG. 20B, 

15 SI represents the full length p23 because it spans from 
amino acid at position 1 (denoted laa in the figure) to 
the amino acid at position 201 (denoted 20laa) of p23 . 
S2 is a protein representing a fragment of p23, spanning 
from amino acid at position 37 to position 201 of p23. 

20 Similarly, in FIG. 21B, MS is a protein representing a 
fragment of the unglycosylated gpl8, spanning from amino 
acid at position 1 to position 70 of the unglycosylated 
gpl8. These fragments were recombinantly produced by 
selecting the appropriate PCR primers for the fragments 

25 based on the cDNA of p23 and gpl8 disclosed in this 
patent application and by cloning the respective cDNA 
fragments into the prokaryotic expression vector pET15b 
(Novagen) using techniques known in the art such as 
described in Example 3, above. 

30 To test their immunoreactivity, these truncated 

fragments of p23 and gpl8 proteins were used in ELISA 
with sera from 6 BDV infected rats (15 weeks post 
infection) and 7 iramunoreactive schizophrenic patients 
(of Section I above) . Horse sera were only from acutely 

35 infected animals, at a stage of disease where antibodies 
to gpl8 are not present, thus, sera from 4 BDV infected 
horses were used only to study p23. For truncated 
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fragments of P 23, similar patterns of immunoreactivity 
were found for sera from BDV infected rats, BDV infected 
horses and schizophrenic patients. In the case of 
truncated fragments of unglycosylated gpl8 proteins, 
instead of using sera from BDV infected horses, the sera 
from 2 mice immunized with native gpl8 were tested (15 
weeks post immunization) . The immunized mice were used 
to determine whether the truncated fragments derived from 
unglycosylated gplB were specifically immunoreactive with 
antibodies raised against native gplB. Again, similar 
patterns of immunoreactivity were found for sera from the 
BDV infected rats, mice immunized with native gpl8, and 
schizophrenic patients. The above results are shown in 
Fig 20A and 21A. respectively, the taller blocks in the 
histograms indicate increased immunoreactivity relative 
to the shorter blocks. Significantly, the above 
truncated fragments did not immunoreact with sera from 
the same controls (and the same number of controls) used 
in Section I above, whereas some sera from human controls 
with no neuropsychiatric disease immunoreacted with the 
full length p23 and gplB proteins (see Table 6) . Thus, 
these fragments are more specific for detecting 
neuropsychiatric disease than the full length proteins. 

ui. flp^n pa MapTT?"" °* peptides Derived tvm p23 and 
op18 

Fine-mapping of epitopes with overlapping peptides 
of P 23 and gplB also revealed that the same determinants 
were detected by the above sera. To determine where an 

i epitope was within each protein, a series of overlapping 
peptides were chemically synthesized and each peptide was 
tested for its ability to bind the antibody from 
schizophrenics and the sera of animals infected with BDV 
and/or immunized with p23 and gplB. 

s As shown in Figs. 22 and 23, peptides of 8-mers were 

chemically synthesized, starting from the amino terminus 
of p23 and gp!8 and spanning the full length of the 
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proteins. Except for the peptides at the amino and 
carboxyl termini of p23 and gplB, each of the intervening 
peptide overlaps its neighboring peptides (at its amino 
and carboxyl ends, respectively) by 4 amino acids. 
5 To map the immunoepi topes on p23, the above 8-mer 

peptides derived from recp23 were tested against sera 
from: 6 rats infected with BDV (15 weeks post infection, 
p. I.); 2 rabbits immunized with recp23 (15 weeks post 
immunization) ; and 7 immunoreactive schizophrenic 

10 patients (of Section I above) . To map the immunoepitopes 
on gp!8, the above 8-mer peptides derived from 
unglycosylated recpl8 were similarly tested, except that 
the rabbit sera were replaced with sera from 2 mice 
immunized with native gpl8 (15 weeks post immunization) . 

15 The immunized mice were used to determine whether the 
series of overlapping 8-mer peptides derived from 
unglycosylated gpl8 were specifically immunoreactive with 
antibodies raised against native gpl8. The controls in 
both tests were the same as in Section I above. 

20 The tests were conducted using SPOTs membrane 

(Genosys Biotechnologies, Inc., The Woodlands, TX, USA) 
and the technique described in Frank, R. , etal., Tetrahedron, 
44:6031-6040 (1988); Blankenmeyer-Menge, B., et al. , 
Tetrahedron Letters, 29 (46) : 5871-5874 (1988) ; Blankenmeyer- 

25 Menge, B., etal. t in "Innovation and Perspectives in Solid- 
Phase Synthesis", (Epton, R. ed.), Chapman and Hall publ. 
(1989); and Blankenmeyer-Menge, B., etal. , Tetrahedron Letters , 
32(12) :1701-1704 (1990) (the tests are hereinafter 
described as the "SPOTs tests"). The results are shown 

30 in Figs. 22 and 23. The blocks in the histogram in each 
figure indicate the peptides which immunoreact with the 
sera, the taller block indicates increased 
immunoreact ivity relative to the shorter blocks. Based 
on the immunoreactivity, the sequences of the epitopes 

35 were deduced. The amino acid sequences of the epitopes 
and thus the peptides are as shown in Tables 7 and 8, 
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below, wherein the peptide sequences are shown from the 
amino terminus (left) to the carboxyl terminus (right) : 



TABLE 7 



5 



10 



15 



20 



Peptides derived from p23 








MATRPSSL 


SEQ 


ID 


NO. 


20 


NALTQPVDQLLK 


SEQ 


ID 


No. 


21 


DQPTGREQ 


SEQ 


ID 


No. 


22 


VRGTLGDI 


SEQ 


ID 


No. 


23 


TAQRCDHS 


SEQ 


ID 


No. 


24 


METMKLMMEKVD 


SEQ 


ID 


No. 


25 


PMLPSHPA 


SEQ 


ID 


No. 


26 


TADEWDII 


SEQ 


ID 


No. 


27 


TABLE 8 










Peptides derived from gpl8 








MNSKHSYV 


SEQ 


ID 


No. 


28 


TLMLEIDF 


SEQ 


ID 


No. 


29 


GHSLVNIYFQID 


SEQ 


ID 


No. 


30 


YKDPIRKY 


SEQ 


ID 


No. 


31 


AFNVFSYR 


SEQ 


ID 


No. 


32 



The result of representative SPOTs tests with the 8- 
25 mer peptides derived from unglycosylated gpl8 are 
graphically shown in FIG. 24A, the panels contained sera 
from: 1 mouse immunized with native gpl8, 1 rat infected 
with BDV, and 1 schizophrenic human, respectively. Each 
spot on the panels indicates the immunoreation of a serum 
sample with an 8 -mer unglycosylated gpl8 peptide. As 
shown in the scale on Fig 24B, the darker the spots, the 
higher the immunoreactivity. The lightest spot (Scale 1) 
indicates no detectable immunoreactivity; and the darkest 
spot (Scale 4) indicates highest immunoreactivity. As 
shown in FIG. 24A, the immunoreactivity pattern of the 
sera against the peptides were similar for all the 
animals/humans tested. -Based on the pattern of 
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immunoreactivity as shown by the spots, the epitopes El 
to E5 were mapped. The result of the epitope mapping is 
graphically shown in FIG. 24B, the height of the blocks 
is directly proportional to the degree of 
5 immunoreactivity of the peptides tested which span the 
full length of gpl8, from amino acid at position 1 to 
position 142. The sequences of the mapped epitopes, El 
to E5, are listed below the histogram of FIG. 24B. The 
epitopes mapped are the same as in Table 8, above, and 

10 confirmed that the sera were specifically immunoreactive 
with epitopes found within gpl8. Again, significantly, 
the control sera did not immunoreact with the peptides. 

The same test was applied to the overlapping 8-mer 
peptides derived from p23, except that the mice were 

15 immunized with recp23. A similar result was obtained, Le. 
the immunoreactivity pattern of the sera against the 
peptides were similar for all the animals /humans tested, 
and the test produced the epitopes shown in Table 7, 
above. Again, significantly, the control sera did not 

20 immunoreact with the peptides. 

In summary, the above truncated fragments, epitopes 
and peptides, and nucleotide sequences which encode them 
or which are complementary to these encoding nucleotide 
sequences, can be used to: (1) diagnose, prognose, 

25 monitor, and manage BDV infection/disease and 
schizophrenia, and more generally neurologic and 
neuropsychiatric diseases; and (2) vaccinate an animal or 
human against the foregoing infection and diseases. 
Other useful truncated immunoreactive fragments, epitopes 

30 and peptides can be similarly derived from the other BDV 
proteins using the method of this Example. Thus, the 
nucleotide sequences encoding these truncated fragments, 
epitopes and peptides, nucleotide sequences complementary 
to the foregoing, and recombinant vectors and cells 

35 expressing the truncated fragments, epitopes and 
peptides, and their uses are also claimed here. The 
vaccines, diagnostic, prognostic and monitoring methods, 
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recombinant vectors and cells, and nucleotide sequences, 
can be made using the teaching contained in this patent 
application in combination with methods known in the art. 
The above findings also suggest an association between 
5 BDV infection and schizophrenia. 

Identification and Characterization of the BDV O- 
Protein 

10 Although the BDV antigenome contains five major 

ORFs, products are reported only for the first three ORFs 
on the antigenome: N ( P 40) , P ( P 24/p23) and M (gplB) . 
The fourth ORF predicts a protein (G-protein) of 57 kDa 
that contains potential N-glycosylation sites. We have 

15 used a Semliki forest virus (SFV) vector to express the 
fourth ORF in BHK-21 cultured cells. The expressed 
protein migrated at 94 kDa in a 10% SDS-PAGE analysis. 
A 94 kDa BDV-specific protein was also identified in 
infected C6 cells by immunoprecipitation with sera from 

20 infected rats. The expressed protein was markedly 
sensitive to tunicamycin, endoglycos idase F/N- 
glycosidase and endoglycos idase H, indicating that the 
protein is an N- linked glycoprotein, largely comprised of 
high mannose- and/or hybrid-type oligosaccharides. 

25 SFVp57 transfected cells showed surface expression of the 
protein and formed syncitia. The protein's presence on 
the surface of transfected cells supports the hypothesis 
that the G-protein may be a virion surface attachment 
protein. 

30 The fourth ORF in BDV predicts a protein of 57 kDa 

with N- and O- glycosylation sites and hydrophobic 
domains. Our findings show that in fact the fourth ORF 
encodes a BDV G-protein of approximately 94 kDa with 
multiple N-glycosylation and O-glycosylation sites and 

35 hydrophobic domains at the amino and carboxyl termini 
reminiscient of the signal sequence and transmembrane 
domains found in rhabdovirus G-proteins (FIG. 25) . FIG. 
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25 shows the predicted amino acid sequence of the BDV 
G-protein (this BDV G-protein is also referred to as p57 
in this patent application , and the amino acid sequence 
is listed as SEQ ID No. 8, above) . Boxed regions 
5 represent the putative endoplasmic reticulum ("ER" ) 
signal peptide sequence (amino acids 7 to 20) and 
transmembrane domain (amino acids 468 to 4 88) . Bold 
underlined sequences represent potential N-glycosylation 
sites. We now report that this ORF directs the 
10 expression of a 94-kDa N- linked glycoprotein. 

Expression of a 94 kDa protein from the BDV p57 ORF 
using an SFV expression vector* The p57 ORF (nt. 2229 to 
nt. 3744; Strain v) {Briese, T. , etai, Proc. Natl. Acad. Set 

15 USA, 91:4362-4366 (1994)} was amplified by RT-PCR from 
BDV- infected C6 cell RNA by using primers 
5 ' - CGCAATCAATGCAGC (SEQ ID NO 34) and 5' -TTCCTGCCACCGGCCG 
(SEQ ID NO 35) and cloned into the Smal restriction site 
in vector pSFV-1 (GibcoBRL, Life Technologies, Inc., 

20 Grand Island, New York) {Marchuk, D., etal. , Nucleic Acids Res. , 
19:1154 (1990)} to create pSFVp57. Capped SFV genomic 
RNA encoding p57 or 0-galactosidase (control) , were 
transcribed from pSFVJ?57 or pSFV3LacZ template, 
respectively. After tranfection into BHK-21 cells, the 

25 over expressed proteins were analyzed by 
immunohistochemical and biochemical assays {Kriegler, M., 
In Gene transfer and expression: a laboratory manual, p. 219-224, 
Stockton Press, New York (1990)}. The SFVp57 cells 
formed large multinucleated syncytia and expressed a 

30 surface protein detected by sera from BDV infected rats 
("BD-rat sera n or "BDSe") but not normal rat sera (NLSe) . 
SFVLacZ cells did not form syncytia, or express proteins 
reactive with either BDSe or NLSe . The syncytia 
formation and surface protein expression observed in 

35 cells transfected with SFV-p57 were similar to those 
described in cells transfected with SFV vectors 
containing other glycoproteins {Gallaher, W. R. , et al. , /. 
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Virol., 14:813-820 (1974); Paul, N. L. . et al. , AIDS Res. and 
Hum. Retroviruses, 9 : 963-970 (1993) } . 

Lysates of metabolically- labeled SFVp57 and SFVLacZ 
cells were used for immunoprecipitation (IP) experiments 
5 with BDSe and NLSe. Approximately 2 X 10* transfected 
cells (16 hrs after electroporation) were incubated for 
two hours in 1 ml of methionine-minus Modified Eagles 
Medium (GibcoBRL. Life Technologies, Inc., Grand Island, 
New York). Thereafter, 0.2mCi of 35 [S]Met-Cys-protein 

10 label mix (New England Nuclear, Boston, Massachusetts) 
was added for eight hours to radiolabel newly synthesized 
proteins. Cell lysates were subjected to IP according to 
Yamashita et al. {Yamashita, Y. , et al. , J. Virol. , 68:7933-7943 
(1994)} except for the modification that protein G- 

15 sepharose (Sigma) was substituted for protein A. After 
SDS-PAGE and autoradiography, a 94 kDa protein was 
detected in lysates of SFVp57 cells but not in lysates of 
SFVLacZ cells. 



20 



Identification of a BDV-speci£ic 94 kDa protein in 
BDV- infected C6 cells. Lysates of BDV- infected C6 cells 
(C6BDV) and non- infected C6 cells were metabolically- 
labeled and IP with BDSe or NLSe for analysis by SDS-PAGE 
and autoradiography (see above). At least six BDV- 
25 specific proteins were detected in lysates from infected 
cells by BDSe that were not detected in lysates of 
infected cells by NLSe or in lysates of noninf ected cells 
by BDSe. These included proteins of 200 kDa (pol) , 94 
kDa (G-protein) , 40 kDa (N protein) , 36 kDa, 33 kDa and 
30 23 kDa (P protein). Whether the 36 kDa or 33 kDa 
proteins are of viral or host origin is unknown. 

Characterization of the BDV 94 kDa protein. 

Metabolically-labeled SFV-p57 and SFVLacZ cells were 
35 treated with 10 ng/ml tunicamycin to inhibit N-linked 
glycosylation. Cell lysates were used in IP experiments 
with BDSe or NLSe prior to SDS-PAGE and autoradiography. 
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BDSe immunoprecipitated proteins of 94 kDa and 64 kDa 
from SFVp57 cell lysates. The 94 kDa protein was 
detected in untreated cells but not in tunicamycin- 
treated cells. Conversely, the 64 kDa protein was 
.5 detected in treated cells but not in untreated cells. 
Neither protein was detected in SFVLacZ cells. 

Radiolabeled proteins immunoprecipitated by BDSe 
were eluted from the sepharose beads by incubation in 
sixty microliters of 50 mMTris-HCl, pH 6.8, 0.4% SDS, 

10 0.1M 2-mercaptoethanol at 95°C for 10 min and digested 
with endoglycosidase H (Endo H) (Boehringer Mannheim) ; 
endoglycosidase F and N-glycosidase F (Endo F/PNGase 
F) (J. Elder) ; Endo H, neuraminidase (Boehringer Mannheim) 
and O-glycosidase (Boehringer Mannheim) ; or neuraminidase 

15 and O-glycosidase. Methods for carbohydrate digestion 
followed protocols of the manufacturer (Endo H, 
neuraminidase, O-glycosidase) or Alexander and Elder 
(Endo F/PNGase F) {Alexander, S., etal., Meth. EnzymoL, 
179:505-518 (1989)}; using Endo H, 2.0 mU; Endo F/PNGase 

20 F, 25 mU; O-glycosidase, 0.8 mU; neuraminidase, 1.0 mU in 
40 microliter reactions. Controls for these reactions 
included incubation of proteins with digestion buffer 
alone. Radiolabeled p57 was prepared by in vitro 
translation in rabbit reticulocytes in the absence of 

25 microsomal membranes to provide a nonglycosylated G- 
protein standard {Lipkin, W. I., etal., Proc. Natl Acad. Sci. 
USA, 87:4184-4188 (1990)}. Proteins were subjected to 
10% SDS -PAGE and analyzed by autoradiography. Treatment 
with endo H or endo F/PNGase F resulted in an apparent 

30 shift in approximately mw from 94 kDa to 64 kDa, the 
position of radiolabeled nonglycosylated G-protein. 
Treatment with neuraminidase and O-glycosidase did not 
allow resolution of a shift. Therefore, to enhance the 
sensitivity of SDS-PAGE for identifying small shifts in 

35 mw following O-glycosidase digestion, protein was first 
incubated with endo H. Comparison of protein digested 
with endo H alone with protein digested with endo H, 
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neuraminidase and O-glycosidase revealed a subtle shift 
of less than 1 kDa. 

Antibodies to the BDV 94 kDa protein in BDV-inf ected 
5 rats. Viral glycoproteins tend to be immunoreactive and 
are often targets for neutralizing antibodies. Previous 
studies of the BDV M-protein revealed epitopes that bind 
neutralizing antibodies {Hatalski, C. G-, etal., J. Virol, 
69:741-747 (1995)}. Because adsorption experiments using 
10 purified M-protein did not completely abrogate 
neutralization activity in chronic BDSe, it was proposed 
that additional neutralization epitopes might be present 
on the putative G-protein {Hatalski, C. G., etal., J. Virol. , 
69:741-747 (1995)}. To address the possibility that 
15 serum antibodies to G-protein might be present at higher 
titer in chronic BD-rat, sera from BD-rats sacrificed at 
different phases of disease were used to IP proteins from 
metabolically- labeled SFVp57 cells for analysis by SDS- 
PAGE and autoradiography. The 94 kDa protein signal was 
20 approximately 10-fold higher after IP with sera collected 
from animals 3 months post infection (chronic phase) than 
after IP with sera collected from animals 1 month post 
infection (acute phase) . 

This study was initiated to identify and 
25 characterize the product of the fourth ORF on the BDV 
antigenome. Expression of this ORF in SFVp57 cells 
yielded a 94 kDa glycoprotein. Inhibition of 

carbohydrate conjugation by tunicamycin and endo F/PNGase 
F sensitivity indicate a primary role for N-linkage. 
30 Sensitivity to endo H suggests that the N-linked 
carbohydrate is largely comprised of high mannose- and/or 
hybrid-type oligosaccharides {Tarentino, A. L., etal., 
Methods Enzymol. , 50:574-580 (1979)}. The subtle shift in 
apparent mw after digestion with neuraminidase and 0- 
35 glycosidase may represent the presence of 0-linked 
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carbohydrate {Hawkins, L. K., et at., Virol, 210:335-344 
(1995) }. 

Although there are no direct data to indicate a 
function for BDV G-protein, several observations suggest 
5 that it could play a role in viral attachment and/or 
penetration. First, the BDV G-protein contains a 
carboxyl transmembrane domain and localizes to the plasma 
membrane in pSFV57 cells. These features are 

reminiscient of other enveloped viral systems where G- 

10 proteins mediate early events in infection {White, J., et 
al, Q. Rev. Biol Phys. , 16:151-195 (1983)}. Second, treatment 
of BDV particle preparations with N-acetylglucosaminidase 
and mannosidase reduced infectivity suggesting the 
importance of N- acetyl glucosamine and mannose residues in 

15 terminal positions {Stoyloff, R. , et al. , Arch. Virol, 
137:405-409 (1994)}. Although the BDV M-protein may 
serve as a viral attachment protein, it does not contain 
terminal mannose residues {Kliche, S., et al , J. Virol, 
68:6918-6923 (1994)}. In contrast, the BDV G-protein 

20 does include terminal mannose residues and could 
therefore represent the sensitive virion surface 
component identified in particle infectivity experiments. 
Third, the titer of antibodies to G-protein in infected 
rats increased dramatically in the chronic phase of 

25 infection, consistent with the timecourse for appearance 
of neutralizing antibodies {Hatalski, C. G., etal., J. Virol, 
69:741-747 (1995)}. 

All publications and patent applications mentioned 
30 in this Specification are herein incorporated by 
reference to the same extent as if each of them had been 
individually indicated to be incorporated by reference. 

Although the foregoing invention has been described 
in some detail by way of illustration and example for 
35 purposes of clarity and understanding, it will be obvious 
that various modifications and changes which are within 
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the skill of those skilled in the art are considered to 
fall within the scope of the appended claims. Future 
technological advancements which allows for obvious 
changes in the basic invention herein are also within the 
5 claims • 

Deposit 

The cDNA of BDV genomic RNA sequence has been 
deposited in the GenBank data base (accession no. 

10 U04608) . This GenBank sequence is hereby incorporated by 
reference in its entirety. 

The recombinant transfer vector, suitable for 
transformation into Escherichia coU DH10, containing four 
overlapping cDNA libraries (as described in Example 1, 

15 above) representing the entire BDV viral genome has been 
deposited under the Budapest Treaty, at the American Type 
Culture Collection, Rockville, MD 20852 (U.S.A.) on 
December 30, 1994 under the deposit name BDVD04608, and 
ATCC Accession No. 97008. 

20 Availability of the deposited recombinant tranfer 

vector is not to be construed as a license to practice 
the invention in contravention of the rights granted 
under the authority of any government in accordance with 

its patent laws. 

25 Also, the present invention is not to be considered 

limited in scope by the deposited recombinant transfer 
vector, since the deposited vector is intended only to be 
illustrative of particular aspects of the invention. Any 
recombinant transfer vector which can be used to prepare 

30 recombinant microorganism which can function to produce 
a recombinant protein product described herein is 
considered to be within the scope of this invention. 
Further, various modifications of the invention in 
addition to those shown and described herein which are 

35 apparent to those skilled in the art from the preceding 
description are considered to fall within the scope of 
the appended claims. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

fi) APPLICANT: The Regents of the University of California 

CM) TITLE OF INVENTION: Bo ma Disease Viral Sequences, 

Diagnostics and Therapeutics for Central Nervous 
Systen Diseases 

(iii) NUMBER OF SEQUENCES: 35 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Bobbins, Berliner i Carson 
(S) STREET: 201 H. Figueroa St., Suite 500 

(C) CITY: Los Angeles 

(D) STATE: California 

(E) COUNTRY: USA 

(F) ZIP: 90012-2628 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 
CO OPERATING SYSTEM: PC- DOS/MS -DOS 

CD) SOFTWARE: Patentln Release #1.0, Version #1.25 

Cvi> CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: 
CB) FILING DATE: 
CO CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

CA) APPLICATION NUMBER: US 08/369,622 
APPLICATION NUMBER: US 08/434,631 
APPLICATION NUMBER: Unknown 

CB) FILING DATE: 06- JAN -1995 
FILING DATE: 04 -MAY -1995 
FILING DATE: 04 -JANUARY -1996 

CO CLASSIFICATION: 

Cviii) ATTORNEY/AGENT INFORMATION: 

CA) NAME: Berliner, Robert 

CB) REGISTRATION NUMBER: 20,121 

CO REFERENCE/DOCKET NUMBER: 5555-357C2 

Cix) TELECOMMUNICATION INFORMATION: 

CA) TELEPHONE: 213/977-1001 

CB) TELEFAX: 213/977-1003 



C2) INFORMATION FOR SEO ID N0:1: 

Ci) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 1112 base pairs 

CB) TYPE: nucleic acid 
CO STRAND EDNESS: single 
CD) TOPOLOGY: linear 

Cii) MOLECULE TYPE: cONA 

Ciii) HYPOTHETICAL: NO 

Civ) ANTI-SENSE: NO 

Cxi) SEQUENCE DESCRIPTION: SEO ID N0:1: 
ATGCCACCCA AGAGACGCCT GGTTGATGAC GCCGATGCCA TGGAGGATCA A6ATCTATAT 
GAACCCCCAG CGAGCCTCCC TAAGCTCCCT GGGAAATTCC TACAATACAC CGTTGGGGGG 
TCTGACCCGC ATCCGGGTAT AGGGCATGAG AAAGACATCA GGCAGAACGC AGTGGCATTG 
TTAGACCAGT CACGGCGCGA TATGTTTCAC ACAGTAACGC CTAGCCTTGT GTTTCTATGT 



60 

120 

180 

240 
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TTGCTAATCC CAGGACT6CA COCTGCGTTT CTTCACGGAG GGOTGCCTCG TGAATCCTAC 500 

CTCTCCACGC CTGTCACGCG TGGAGAACAG ACTGTTGTTA AGACTGCGAA GTTTTACGGG 360 

GAAAAGACGA CGCAGCGTGA TCTCACCGAG CTGGAGATCT CCTCTATCTT CAGCCATTGT 420 

TGCTCATTAC TAATAGGGGT TGTGATA6GA TCGTCGTCTA AGATCAAAGC AGGACCCOAG 480 

CAGATCAAGA AAAGGTTTAA AACTATGATG GCAGCCTTAA ACCGGCCATC CCATGGTGAG 540 
ACTGCTACAC TACTCCAGAT GTTTAATCCA CATGAGGCTA TAGATTGGAT TAACGGCCAA 
CCCTGGGTA6 GCTCCTTTGT GTTGTCTCTA CTAACTACAG ACTTTGAGTC CCCAGGTAAA 
GAATTTATGG ACCAGATTAA GCTTGTCGCA AGTTATGCAC AGATGACTAC GTACACTACT 
ATAAAGGAGT ACCTCGCA6A ATGCATGGAT GCTACCCTTA CAATCCCCGT AGTTGCATAT 

GAGATCC6T6 ACTTTTTAGA AGTTTCAGCA AAGCTTAAGG AGGATCATGC TGACCTGTTC 840 

CCGTTTCTGG GGGCCATTAG ACACCCC6AC GCTATCAAGC TGGCGCCACG AAGCTTTCCC 900 

AATCTGGCCT CCGCAGCGTT TTACTGGAGT AA6AAGGAAA ACCCCACAAT GGCAGGCTAC 960 

CCGGCCTCCA CCATCCAGCC GGGCGCAAGT GTCAAGGAAA CCCAGCTTGC CCGGTATAGG 1020 

COCCGCGAGA TATCTCGTGG AGAGGACGGG GCAGAGCTCT CAGGTGAGAT CTCTGCCATA 1080 

ATGAAGATGA TAGGTGTGAC T6GTCTAAAC TA 1112 

(3) INFORMATION FOR SEO 10 N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 370 mAno eeid. 

(B) TYPE: mint •eld 

(C) STRANDEDNESS: tingle 

(D) TOPOLOGY: unknoun 

(H> MOLECULE TYPE: protein 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 
(v) FRAGMENT TYPE: internet 
(Kl) SEQUENCE DESCRIPTION: SEO ID NO:2: 

Net Pro Pro Ly» Ar, Ar, Leu vol Asp A* Ale Mp Al. Met Glu top 
1 5 10 

Gin Asp Leu Tyr Glu Pro Pro Ale Ser Leu Pro Ly. Leu Pro Gly Lys 
20 25 *• 

Ph. Leu Gin Tyr Thr Vel Gly Gly Ser Asp Pro Hie Pro Gly lie Gly 

35 40 45 

His Glu Lys Asp He Ar, Gin Asn Al. W.l Al. L«i Leu top Gin S«- 



50 



Ar, Ar, Asp Met Phe Hi. Thr V.I Thr Pro Ser Leu V.I Phe Leu Cys 

65 70 

Leu Leu He Pro Gly Leu Hi. Al. Al. Phe V.l Hi. Gly Gly V.I Pro 
85 ™ 

Ar, Glu Ser Tyr Leu Ser Thr Pro V* Thr Ar, Gly Glu Gin Thr V.l 
100 iO> ,IW 

W.l Lys Thr Al. Ly. Phe Tyr Gly Glu Ly. Thr Thr Gin Arg Asp Leu 

115 120 
Thr Glu Leu Glu tie Ser Ser He Phe Ser His Cys Cys Ser Leu Leu 
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130 13S 140 

Ite Gly Val Vol lie Gly Ser Ser Ser Lys lie Lys Ale Gly Ale Glu 
H5 150 155 160 

Gin lie Lys Lys Arg Phe Lys Tttr Net Net Ale Ale Leu Asn Arg Pro 
165 170 175 

Ser His Gly Glu Thr Ala Thr Leu Leu Gin Net Phe Asn Pro His Glu 
160 185 190 

Ale lie Asp Trp lie Asn Gly Gin Pro Trp Val Gly Ser Phe Val Leu 
195 200 205 

Ser Leu Leu Thr Thr Asp Phe Glu Ser Pro Gly Lys Glu Phe Net Asp 
210 215 220 

Gin lie Lys Leu Val Ale Ser Tyr Ala Gin Net Thr Thr Tyr Thr Thr 
225 230 235 240 

lie Lys Glu Tyr Leu Ale Glu Cys Net Asp Ale Thr Leu Thr He Pro 
245 250 255 

Val Vel Ale Tyr Glu He Arg Asp Phe Leu Glu Val Ser Ale Lys Leu 
260 265 270 

Lys Glu Asp His Ale Asp Leu Phe Pro Phe Leu Gly Ale He Arg His 
275 280 285 

Pro Asp Ale He Lys Leu Ale Pro Arg Ser Phe Pro Asn Leu Ale Ser 
290 295 300 

Ale Ale Phe Tyr Trp Ser Lys Lys Glu Asn Pro Thr Net Ala Gly Tyr 
305 310 315 320 

Arg Ala Ser Thr He Gin Pro Gly Ale Ser Vel Lys Glu Thr Gin Leu 
325 330 335 

Ale Arg Tyr Arg Arg Arg Gtu He Ser Arg Gly Glu Asp Gly Ala Glu 
340 345 350 

Leu Ser Gly Glu He Ser Ale He Net Lys Net He Gly Val Thr Gly 
355 360 365 

Leu Asn 
370 



(4) INFORMATION FOR SE0 ID NO:3: 

(1) SEQUENCE CHARACTERISTICS: 
CA) LENGTH: 609 bese pairs 
(8) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

<*i> SEQUENCE DESCRIPTION: SE0 ID N0:3: 
ATGGCAACGC GACCATCGAG TCTGGTCGAC TCCCTGGAG6 ACGAAGAAGA TCCCCAGACA 60 
CTACGACGGG AACGACCGGG GTCACCAAGA CCACGGAAGG TCCCAAGGAA TGCATTGACC 120 
CAACCAGTAG ACCAGCTCCT GAAGGACCTC AGGAAGAACC CCTCCATGAT CTCAGACCCA 160 
GACCAGCGAA CCGGAAGGGA GCAGCTGTCG AATGATGAGC TAATCAAGAA GTTAGTGACG 240 
GAGCTGGCCG AGAATAGCAT GATCGAGGCT GAGGAGGTGC GGGGCACTCT TGGAGACATC 300 
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TCGGCTCGTA TCGAG6CAGG GTTT6AGTCC CTGTCCCCCC TCCAAGTGGA AACCATCCAG 360 
ACAGCTCAGC GGTGCGATCA CTCCGACAGC ATCAGGATCC TCGGCGACAA CATCAAGATA 420 
CTAGATC6CT CCATGAACAC AATGAT6GA6 ACAATGAAGC TCATGATGGA GAAGGTGGAT 480 
CTCCTCTACG CATCAACCGC CGTTGG6ACC TCTGCACCCA TCT76CCCTC CCATCCT6CA 540 
CCTCCCCGCA TTTATCCCCA GCTCCCAAGT GCCCCGACAA CGGATGAATG 66ACATCATA 600 
CCATAAAAA 

(5) INFORMATION FOR SEO ID N0:4: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 201 aoino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: tingle 
(0) TOPOLOGY: unknown 

(it) MOLECULE TYPE: protein 

(iii) HYPOTHETICAL: NO 

<iv) ANTI -SENSE: NO 

(xi> SEOUENCE DESCRIPTION: SEO ID NO:4: 

Met Alt Thr Arg Pro Ser Ser Lou Vol Asp Ser Leu Glu Asp Glu Glu 
! 5 10 15 

Asp Pro Gin Thr Leu Arg Arg Glu Arg Pro Gly Ser Pro Arg Pro Arg 
20 25 30 

Lys Vol Pro Arg Asn Ale Leu Thr Gin Pro Vol Asp Gin Leu Leu Lys 
35 40 45 

Asp Leu Arg Lys Asn Pro Ser Met lie Ser Asp Pro Asp Gin Arg Thr 
50 55 « 

Gly Arg Glu Gin Leu Ser Asn Asp Glu Leu lie Lys Lys Leu Val Thr 
70 75 



65 



Glu Leu Ale Glu Asn Ser Met He Glu Ala Glu Glu Val Arg Gly Thr 
85 °0 95 

Leu Gly Asp lie Ser Ala Arg lie Glu Ala Gly Phe Glu Ser Leu Ser 
too 105 110 

Ala Leu Gin Val Glu Thr lie Gin Thr Ala Gin Arg Cys Asp His Ser 
115 120 125 

Asp Ser lie Arg lie Leu Gly Glu Asn He Lys lie Leu Asp Arg Ser 
130 135 140 

Net Lys Thr Met Met Glu Thr Met Lys Leu Met Met Glu Lys Val Asp 

Leu Leu Tyr Ala Ser Thr Ala Val Gly Thr Ser Ala Pro Met Leu Pro 
165 170 »'* 

ser His Pro Ala Pro Pro Aro He Tyr Pro Bin Leu Pro Ser Ala Pro 
180 ™ 1W 

Thr Thr Asp Glu Trp Asp He lie Pro 
195 200 

(6) INFORMATION FOR SEO ID N0:5: 

Ci> SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 428 base pairs 
(B) TYPE: nucleic acid 
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(C) STRAND EDNESS: tingle 

(D) TOPOLOGY: linear 

(II) MOLECULE TYPE: cONA 
(iii) HYPOTHETICAL: NO 
(iv) ANT t -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 

ATGAATTCAA AACATTCCTA TCTGGAGCTC AA6GACAAGC TAATCGTCCC TGGATGCCCC 60 

ACACTGATGC TTGAGATAGA CTTTGTAGGG GGGACTTCAC GGAACCA6TT CCTTAACATC 120 

CCATTTCTTT CA6TGAAAGA GCCTCT6CAG CTTCCACGCG AGAAGAAGTT GACCGACTAC 180 

TTTACTATTG ACGTAGAACC ACCAGGTCAT TCCCTGGTCA ATATATACTT CCAGATTGAC 240 

GACTTCTTGC TCCTAACACT CAACTCACTA TCTGTGTACA AGGACCCGAT TAGAAAATAC 300 

ATGTTCCTAC GCCTCAACAA GGACCAGAGC AAGCACGCAA TCAATGCAGC CTTCAATGTC 360 

TTTTCTTATC GGCTTCGGAA CATTGGTGTT GGTCCTCTCG GCCCGGACAT TCGATCTTCA 420 

GGGCCTTA 428 



(7) INFORMATION FOR SEO ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 142 anino acids 

(B) TYPE: amino acid 
(C> ST HANDEDNESS: single 
(D) TOPOLOGY: unknoun 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:6: 

Met Asn Ser Lya His Ser Tyr Val Glu Leu Lys Asp Lys Vsl He Val 

15 10 15 

Pro Gty Trp Pro Thr Leu Net Leu Glu He Asp Phe Val Gly Gly Thr 
20 25 30 

Ser Arg Asn Gin Phe Leu Asn lie Pro Phe Leu Ser Val Lys Glu Pro 
35 40 45 

Leu Gin Leu Pro Arg Glu Lys Lys Leu Thr Asp Tyr Phe Thr lie Asp 
50 55 60 

Val Glu Pro Ala Gly His Ser Leu Val Asn He Tyr Phe Gin He Asp 
65 70 75 80 

Asp Phe Leu Leu Leu Thr Leu Asn Ser Leu Ser Val Tyr Lys Asp Pro 
85 90 95 

He Arg Lys Tyr Met Phe Leu Arg Leu Asn Lys Asp Gin Ser Lys His 
100 105 110 

Ala He Asn Ala Ala Phe Asn Val Phe Ser Tyr Arg Leu Arg Asn He 
115 120 125 

Gly Val Gly Pro Leu Gly Pro Asp He Arg Ser Ser Gly Pro 
130 135 140 



(8) INFORMATION FOR SEQ ID N0:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1515 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 
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(D) TOPOLOGY: linear 
MOLECULE TYPE: cONA 
(Hi) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:7: 

ATGCAGCCTT CAATGTCTTT TCTTATCOCC TTCGCAACAT T6GTGTTGGT CCTCTCGCCC 60 

CGGACATTCG ATCTTCAG6G CCTTACCTGC AATACTGACT CCACTCCTGG ACTOATTGAC 120 

CTGGA6ATAA GGCGACTTTG CCACACCCCA ACGGAAAATG TCATTTCATG CCAGGTTAGT 180 

TATCTCAACC ACACGACTAT TAGCCTCCCG GCA6TCCACA CATCATGCCT CAAGTACCAC 2*0 

TGCAAAACCT ATTGGGGATT CTTTGGTAGC TACAGCGCTG ACCGAATCAT AAATCGGTAC 300 

ACTGGTACTG TTAAGGGTTG TCTAAACAAC TCA6CACCAG AGGACCCCTT CGAGTGCAAC 360 

TGGTTCTACT GCTGCTCGGC GATTACAACA GAGATCTGCC GATGCTCTAT TACAAAT6TC 420 

ACGGTGGCTG TGCAAACATT CCCACCCTTC ATGTACTGCA GTTTTGCAGA CTGCAGTACC 480 

GT6AGCCAAC AGGAGCTAGA 6AGTGGAAAG GCAATGCT6A 6C6ATGGCAG TACATTAACT 540 

TATACCCC6T ATATCCTACA CTCAGAAGTC GTGAACAAAA CCCTCAATGG 6ACCATACTC 6O0 

TGCAACTCAT CCTCTAAGAT AGTTTCCTTC GATGAATTTA G6CGTTCATA CTCCCTAAC6 660 

AATGGTAGTT ACCAGAGCTC ATCAATCAAT GTGACGTGTG CAAACTACAC GTCGTCCTGC 720 

CGGCCCAGGT TGAAAAGGCG GCGTAGGGAC ACCCAGCAGA TTGAGTATCT AGTTCACAAG 780 

CTTAGGCCCA CACTGAAAGA TGCATGGGAG GACTGTGAGA TCCTCCAGTC TCT6CTCCTA 840 

CGGGTCTTTG GTACTGGGAT CGCAAGTGCT TCTCAATTTT TGAGGAGCTG GCTCAACCAC 900 

CCTGACATCA TC6G6TATAT AGTTAATGGA GTTGGG6TTG TCTGGCAATG CCATCGTGTT 960 

AATGTCACGT TCATGGCGTG GAATGAGTCC ACCTATTACC CTCCA6TA6A TTACAATGGG 1020 

CG6AA6TACT TCCTGAATGA TGAG66AACG TTACAAACAA ACA CCCCCG A GGCAACGCCA 1080 

GGGCTTAAGC GGGTCATGTG GTTCGGCAGG TACTTCCTAG 66ACASTAGG GTCTGGGGTG 1140 

AAACCGAGGA GGATTCGGTA CAATAAGACC TCACATGACT ACCACCTGGA 60AGTTTGAG 1200 

GCAAGTCTCA ACATGACCCC TCAGACCAGT ATC6CCTCGG GTCATGAOAC AOACCCCATA 1260 

AATCATGCCT ACGGAACGCA GGCTGATCTC CTTCCATACA CCAGGTCTAG TAATATAACA 1320 

TCTACGGATA CAGGCTCAGG CTGGGTGCAC ATC6GCCTAC CCTCATTTGC TTTCCTCAAT 1380 

CCCCTCGGGT GGCTCAGGGA CCTACTTGCA TGGGCAGCCT G6TTGGGTGG GGTTCTATAC 1440 

TTAATAAGTC TTTGTGTTTC CTTACCAGCC TCCTTCGCGA GGAGGAGAC6 CCTCCGCCGG 1500 
TGGCAGGAAT AAACC 

(?) INFORMATION FOR SEO ID N0:8: 

<t> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 «»fno acids 

(B) TYPE: "»ino acid 

(C) STRANDED NESS: tingle 
(0) TOPOLOGY: unknot** 

(U> H0LECULE HPE: protein 



1515 
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(xi> SEQUENCE DESCRIPTION: SEO ID N0:8: 

Net Gin Pro Ser Net Sep Phe Leu He Gly Phe Gly Thr Leu Val Leu 
15 10 15 

Val Leu Ser Ala Arg Thr Phe Asp Leu Gin Gly Leu Ser Cyi Asn Thr 
20 25 30 

Asp Ser Thr Pro Gly Leu lie Asp Leu Glu lie Arg Arg Leu Cys His 
35 40 45 

Thr Pro Thr Glu Asn Val lie Ser Cys Glu Val Ser Tyr Leu Asn His 
50 55 60 

Thr Thr He Ser Leu Pro Ala Val His Thr Ser Cys Leu Lys Tyr His 
65 70 75 80 

Cys Lys Thr Tyr Trp Gly Phe Phe Gly Ser Tyr Ser Ala Asp Arg He 
85 90 95 

He Asn Arg Tyr Thr Gly Thr Val Lys Gly Cys Leu Asn Asn Ser Ala 
100 105 110 

Pro Glu Asp Pro Phe Glu Cys Asn Trp Phe Tyr Cys Cys Ser Ala He 
115 120 125 

Thr Thr Glu He Cys Arg Cys Ser He Thr Asn Val Thr Val Ala Val 
130 135 140 

Gin Thr Phe Pro Pro Phe Net Tyr Cys Ser Phe Ala Asp Cys Ser Thr 
145 150 155 160 

Val Ser Gin Gin Glu Leu Glu Ser Gly Lys Ala Net Leu Ser Asp Gly 
165 170 175 

Ser Thr Leu Thr Tyr Thr Pro Tyr He Leu Gin Ser Glu Val Val Asn 
180 185 190 

Lys Thr Leu Asn Gly Thr He Leu Cys Asn Ser Ser Ser Lys He Val 
195 200 205 

Ser Phe Asp Glu Phe Arg Arg Ser Tyr Ser Leu Thr Asn Gly Ser Tyr 
210 215 220 

Gin Ser Ser Ser He Asn Val Thr Cys Ala Asn Tyr Thr Ser Ser Cys 
225 230 235 240 

Arg Pro Arg Leu Lys Arg Arg Arg Arg Asp Thr Gin Gin I le Glu Tyr 
245 250 255 

Leu Val His Lys Leu Arg Pro Thr Leu Lys Asp Ala Trp Glu Asp Cys 

260 265 270 

Glu He Leu Gin Ser Leu Leu Leu Gly Val Phe Gly Thr Gly He Ala 
275 280 285 

Ser Ala Ser Gin Phe Leu Arg Ser Trp Leu Asn His Pro Asp He He 
290 295 300 

Gly Tyr He Val Asn Gly Val Gly Val Val Trp Gin Cys His Arg Val 
305 310 315 320 

Asn Val Thr Phe Net Ala Trp Asn Glu Ser Thr Tyr Tyr Pro Pro Val 
325 330 335 

Asp Tyr Asn Gly Arg Lys Tyr Phe Leu Asn Asp Glu Gly Arg Leu Gin 
340 345 350 

Thr Asn Thr Pro Glu Ala Arg Pro Gly Leu Lys Arg Val Net Trp Phe 
355 360 365 

Gly Arg Tyr Phe Leu Gly Thr Val Gly Ser Gly Val Lys Pro Arg Arg 
370 375 380 
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It. Arg Tyr Asn ly. Thr Ser HU A«p Tyr HU Leu Glu Glu Phe Glu 
385 390 *^ 

Ala Ser Leu Asn Met Thr Pro Gin Thr Ser He Ala Ser Gly His Glu 
405 410 415 

Thr Asp Pro He Asn His Ala Tyr Gly Thr Gin Ala Asp Leu Leu Pro 
420 *25 430 

Tyr Thr Arg Ser Ser Asn lie Thr Ser Thr Asp Thr Gly Ser Gly Trp 
435 440 445 

Val His lie Gly Leu Pro Ser Phe Ala Phe Leu Asn Pro Leu Gly Trp 
450 455 

Leu Arg Asp Leu Leu Ala Trp Ala Ala Trp Leu Gly Gly Val Leu Tyr 
455 470 475 

Leu He Ser Leu Cys Val Ser Leu Pro Ala Ser Phe Ala Arg Arg Arg 
485 490 495 

Arg Leu Gly Arg Trp Gin Glu 
500 

<10) INFORMATION FOR SEQ ID NO:9: 

<i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
CD) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

(iii) HYPOTHETICAL: NO 

(W) ANTI-SENSE: NO 

(jci) SEQUENCE DESCRIPTION: SEO ID NO:9: 

ATGTCATTTC ATGC6AGCCT CCTTCGCGAG CAGGA6ACGC CTCGGCCGCT GGCAGGAATA 60 

AACCGTACCG ACCAGTCTCT TAAAAACCCT CTCCTCGGAA CAGAGGTCTC TTTCTGCCTT 120 

AAGTCGAGCT CACTCCCCCA TCATGTACGA GCACTAGCCC AGATTAAAGC AAGCAACCT6 180 

GCATCCTGTG ACTATTACTT GCTATTCCGC CAAGTTGTAT TGCCCCCTGA AGTATATCCC 240 

ATTGGTGTTC TAATAAGAGC TGCGGAGGCT ATACTAACAG TTATAGTATC AGCTTGGAAG 300 

CTGGATCATA TGACGAAGAC CCTATACTCC TCTGTGAGAT ATGCACTCAC CAATCCCCGG 360 

GTCC6AGCCC AACTT6AGCT TCACATTGCC TACCAGCGCA TAGTGGGTCA GGTCTCGTAC 420 

AGCCGGGA6G CAGACATAGG GCCAAAAAGG CTTGGGAATA TGTCATTGCA ATTCATCCAA 480 

TCTCTCGTTA TTGCCACCAT AGACACGACA AGCTGCCTAA TGACCTACAA CCACTTTCTT 540 

GCTGCAGCAG ACACAGCCAA GAGCAGATGC CATCTCCTAA TCGCCTCAGT GGTCCA6GGG 600 

CCCCTTTGGG AACAAGGGTC ATTTCTTGAT CATATAATCA ACAT6ATCGA CATAATTGAC 660 

TCAATCAACC TCCCCCATGA TGATTACTTC ACAATTATTA AGTCTATCTT TCCCTACTCC 720 

CAAGGGCTT6 TTATGGGGAG GCATAAT6TA TCAGTCTCCT CTGATTTCGC GTCCGTATTT 780 
GCCATTCCTG AATTAT6CCC GCAACTAGAC AGCTTACTAA AAAAACTGCT CCAACTTGAC 
CCCGTTCTCC TCCTCATGGT CTCTTCGGTG CAGAAGTCAT 6GTACTTCCC TCAGATCCGA 

ATGGTCGACG GGTCACGGGA GCAGCTCCAC AAGATGCGTG TCGAGCT6GA AACGCCCCAA 960 
GCCCTGCTCT CGTACGGCCA TACCCTCCTG TCAATATTTC GG6CAGA6TT TATCAAAGGC 



840 
900 



1020 
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TATGTCTCAA AGAATGCGAA GT6GCCCCCC GTACACCT6C TCCCAGGCTG TGACAAATCC 1080 

ATAAAAAATG CGAGAGAGCT SBCC C SC TfiC AGCCCGGCAT TTGACCGACG ATGGCAGCTC 1140 

TTCGAGAAGG TTGTCATTCT AAGAATTGCT GACCTAGATA TGGATCCCGA CTTCAACGAT 1200 

ATTGTTA6CC ATAAGGCGAT AATCAGCTCA A6AA6GGACT GGGTATTCGA GTACAATGCA 1260 

GCGCCCTTTT GGAAGAAATA CGGTGAACGG TTGGAGACGC CTCCTCCCAG GTCGGGACCG 1320 

TCACGACTTG TGAATGCTCT AATCGATGGA CGCTTAGACA ATATCCCAGC CCTGCTAGAG 1380 

CCATTTTACA GGGGAGCGGT TGAGTTCGAG GATCGGTTGA CTGTGCTCGT GCCTAAGGAG 1440 

AAAGAGTTAA AGGTAAAGGG AAGGTTCTTC TCGAAGCAAA CATTGCCAAT CAGGATATAT 1500 

CAGGTTGTTG CTGAAGCTGC ACTTAAGAAT GAGGTTATGC CATACCTAAA GACACACTCA 1560 

ATGACCATGA GCTCAACGGC TCTAACTCAC CTTCTTAACC GGCTATCACA TACTATCACT 1620 

AAGGGTGACT CCTTTGTTAT TAACCTTGAC TATAGTTCCT GGTGCAACGG TTTCCGACCA 1680 

GAACTGCAGG CCCCAATCTG TCGTCAGTTG GATCAGATGT TCAATTGCGG GTACTTCTTC 1740 

AGGACTGGGT 6CACACTCCC ATGCTTTACC ACGTTTATTA TTCAAGACAG GTTCAACCCG 1800 

CCCTATTCCC TCAGTGGTGA GCCCGTTGAA GACGGAGTTA CATGCGCGGT TGGGACTAAA 1860 

ACAATGGGGG AGGGCATGAG GCAGAAACTA TGGACAATCC TTACGAGCTG CTGGGAGATA 1920 

ATTGCTCTTC GGGAAATTAA CGTGACGTTT AACATACTAG GCCAAGGTGA TAATCAGACA 1980 

AT CAT CAT AC ATAAATCTGC AAGCCAAAAT AACCAGCTAT TAGCGGAGCG AGCACTAGGG 2040 

GCCCTGTACA AGCATGCTAG ATTAGCTG6C CATAACCTCA AGGTAGAGGA ATGCTGGGTG 2100 

TCAGATTGTC TGTATGAGTA TGGAAAGAAG CTTTTCTTCC GTGGTGTACC TGTCCCGGGC 2160 

TGTTTGAAGC AGCTCTCACG GGTGACGGAT TCTACTGGAG AGCTATTCCC AAACCTATAC 2220 

TCAAAGTTAG CCTGCTTAAC ATCATCGTGT TTAAGCGCAG CGATGCCAGA CACATCTCCA 2280 

TGGGTGGCAC TCGCGACAGG TGTCTGTCTG TATCTTATCG AGTTATATGT TGAGCTGCCT 2340 

CCAGCAATCA TGCACGATGA GTCGCTATTG ACGACCCTCT GCCTCGTAGG CCCATCCATT 2400 

G6TGGGCTTC CGACCCCTGC AACCCTACCC AGTGTCTTTT TCAGAGGAAT GTCCGACCCA 2460 

CTGCCCTTTC AGCTAGCACT CTTGCAGACC CTCATTAAGA CGACAGCGGT GACCTGTAGC 2520 

TTGGTGAATC GTGTGGTCAA GTTACGGATA GCACCCTATC CAGACTGGCT CTCTCTAGTG 2580 

ACTGACCCGA CCTCACTCAA CATTGCCCAA GTGTACCGGC CAGAACGTCA GATCAGGAGG 2640 

TGGATTGAGG AAGCGATAGC GACAAGCTCA CACTCGTCAC 6CATAGCAAC TTTCTTCCAG 2700 

CAGCCCCTCA CCGAGATGGC TCA6TTGCTT GCGAGGGACC TTTCAACAAT GATGCCTCTT 2760 

CGACCCCGGG ATATGTCGGC CTTATTCGCA TTATCAAATG TC6CATACGG TTTAAGCATT 2820 

ATAGATCTAT TTCAAAAATC CTCTACCGTT GTTTCTGCAA GTCAAGCTGT CCATATCGAG 2880 

GATGTTGCCC TAGAGAGTGT AAGGTATAAG GAATCTATCA TCCAGGGTCT GTTAGACACC 2940 

ACTGAGGGGT ATAACATGCA ACCTTATTTG GAAGGTTGCA CTTACCTTGC A6CCAAACAG 3000 

TTACGTAGGT TGACATGGGG TCGAGACCTA GTTGGAGTCA CAATGCCCTT TGTTGCCGAG 3060 

CAATTCCATC CTCACAGTTC TGTGGGTGCA AAGGCGGAAC TCTACCTCGA CGCTATTATA 3120 

TACTGCCCAC AGGAGACATT GCGGTCACAC CATCTGACTA CCAGGGGGGA CCAGCCGCTT 3180 

TACCTCGGAT CCAATACGGC TGTCAAGGTC CAGCGAGGTG AGATCACGGG CCTAACAAAG 3240 
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TCAAGCGCTG CAAATCTAGT CAG6GACACT CTCGTTCTCC ATCAGTGGTA TAAAGTCCGT 3300 
AAAGTTACCG ATCCACACTT GAACACCCTC ATGGCACGCT TCTTACTTGA GAAGGGGTAC 3360 
ACATCTGACO CTCGACCTAS CATCCAG6GT GGGACCCTCA CGCATCCTCT CCCATCCCCC 3420 
GGAGACTCAC GGCAGGGGCT TACTGGGTAT GTAAATATAC TAAOTACOTG OCTTCOATTC 3*80 
TCAAGTGATT ATCTTCACTC TTTCTCGAAA TCATCAGACG ACTATACAAT CCACTTTCAG 35*0 
CATGTATTCA CATACGGTTG CCTCTATGCT GATTCGGTGA TTAGATCC6G CGGTGTTATT 3600 
TCCACTCCTT ACCTTTTGAG TGCAAGTTGT AAAACATGCT TTGAGAAGAT AGACTCAGAG 3660 
GAGTTCGTCC TGGCATGTGA ACCCCAATAC AGGGGTGCTG AGT6GCTGAT ATCAAAGCCA 3720 
GTCACTGTCC CTGAGCAGAT AACTGATGCT GAAGTCGAGT TTGACCCCTG TGTGAGTGCG 3780 
GGTTATTGTC TCGGGATTCT CATTGGCAAG TCATTCTTAG TTGACATAAG GGCAAGTGGG 38*0 
CATGATATCA TGGAGCAGCG GACATGGGCT AACCTGGAGA GGTTTTCTGT ATCGGACATG 3900 
CAGAAACTTC CGTGGAGTAT TGTAATTCGG TCTCTCTGGA GATTCCTTAT TGGCGCACGG 3960 
CTCCTTCAGT TTGAGAAGGC TGGCCTCATT AGAATGCTGT ATGCTGCGAC AGGTCCAACC 
CCTAGCTTCC TAATGAAAGT TTTTCAAGAC TCAGCCCTCC TCATGGACTG CGCACCCCTC 
GATCGGCTGT CCCCTAGGAT CAACTTTCAT AGTCGGGGAG ACCTCGTTGC TAAGCTTGTT 
TTATTGCCCT TCATCAACCC 6GGTATAGTG GAGATTGAAG TGTCTGGAAT TAATAGCAAG 
TACCATGCAG TATCGGAGGC CAATATGGAT CTGTACATCG CTGCTGCCAA GTCTGTGGGC 
CTGAAGCCCA CACAGTTTGT TGAGGAAACA AACGACTTTA CGGCCCGCGG CCACCACCAT 
G6TTGTTATT CCCTTTCTTG GTCTAAGTCA CGCAATCAAT CACAG6TCCT AAAGATGGTA *380 
GTACGGAAGC TGAAGCTCTG TGTCCT6TAT ATATACCCCA CAGTCGATCC CGCCGTTGCT 
CTCGACCTGT GCCATCTACC AGCATTAACT ATAATCCTAG TGCTCGGCGG TGACCCAGCG 
TACTATGAGC GATTACTTGA GATGGACCTG TGCGGGGCTG TGTCAA6TCG AGTCGATATC 
CCCCATTCTC TGGCTGGCAG AACCCACAGG GGGTTCGCAG TGGGCCCAGA C6CTGGTCCA 
GGTGTAATTA GACTCGACAG GTTAGAGTCA GTT7GTTATG CTCACCCCTG TTTAGAGGAA 
CTAGAGTTTA ATGCATATCT AGACTCTGAG TTGGTTGACA TTAGTGATAT GTGCTGCCTC 
CCCTTAGCSA CACCCTGTAA GGCCCTTTTC AGGCCAATAT ATCGGAGCTT ACAGTCGTTC 
AGGTTAGCCT TAATGGACAA CTATAGTTT7 GTCATGGACC TCATTATGAT CCGAGGACTG 
CACATTAGGC CTCACCTTGA GGAATTTGAC GAGCTGCTTG TGGTAGGACA 6CACATCCTC 
CGCCAGCCCG TCCTAGTAGA GGTTGTTTAC TACGTTGGAG TTGTTAGGAA GCGCCCTGTG 
TTAGCGAGGC ATCCGTGGTC AGCAGATCTT AAGCGAATTA CTGTGGGGGG GCGGGCTCCC 
TGCCCCTCTG CTGCCAGATT GCGTGATGAG GATTGTCAGG GGTCTCTGTT GGTTGGGCTT 
CCTGCTGGGT TGACGCAGTT ATTGATAATT GATTA 

(11) INFORMATION FOR GEO ID 110:10: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1711 «<no acid* 

(B) TYPE: mivo NCtd 

(C) ST RAW) ED HESS: tingle 
(Dl TOPOLOGY: unknoMi 

(H) MOLECULE TYPE: prot.ln 



4020 
4080 
41*0 
4200 
4260 
4320 



4440 
4500 
4560 
4620 
4680 
4740 
4800 



4920 
4980 
5040 
5100 
5135 



WO 96/21020 



PCIYUS96700418 



131 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met Ser Phe His Ala Ser Leu Leu Arg Glu Glu Glu Thr Pro Arg Pro 
15 10 15 

Val Ala Gly He Asn Arg Thr Asp Gin Ser Leu Lys Asn Pro Leu Leu 
20 25 30 

Gly Thr Glu Val Ser Phe Cys Leu Lys Ser Ser Ser Leu Pro His His 
35 40 45 

Val Arg Ala Leu Gly Gin lie Lys Ala Arg Asn Leu Ala Ser Cys Asp 
50 55 60 

Tyr Tyr Leu Leu Phe Arg Gin Val Val Leu Pro Pro Glu Val Tyr Pro 
65 70 75 80 

He Gly Val Leu lie Arg Ala Ala Glu Ala lie Leu Thr Val He Val 
85 90 95 

Ser Ala Trp Lys Leu Asp Nis Net Thr Lys Thr Leu Tyr Ser Ser Val 
100 105 110 

Arg Tyr Ala Leu Thr Asn Pro Arg Val Arg Ala Gin Leu Glu Leu His 
115 120 125 

He Ala Tyr Gin Arg He Val Gly Gin Val Ser Tyr Ser Arg Glu Ala 
130 135 140 

Asp He Gly Pro Lys Arg Leu Gly Asn Net Ser Leu Gin Phe He Gin 
145 150 155 160 

Ser Leu Val He Ala Thr He Asp Thr Thr Ser Cys Leu Net Thr Tyr 
165 170 175 

Asn His Phe Leu Ala Ala Ala Asp Thr Ala Lys Ser Arg Cys His Leu 
180 185 190 

Leu He Ala Ser Val Val Gin Gly Ala Leu Trp Glu Gin Gly Ser Phe 
195 200 205 

Leu Asp His He He Asn Net He Asp He He Asp Ser He Asn Leu 
210 215 220 

Pro His Asp Asp Tyr Phe Thr He He Lys Ser He Phe Pro Tyr Ser 
225 230 235 240 

Gin Gly Leu Val Net Gly Arg His Asn Val Ser Val Ser Ser Asp Phe 
245 250 255 

Ala Ser Val Phe Ala He Pro Glu Leu Cys Pro Gin Leu Asp Ser Leu 
260 265 270 

Leu Lys Lys Leu Leu Gin Leu Asp Pro Val Leu Leu Leu Net Val Ser 
275 280 285 

Ser Val Gin Lys Ser Trp Tyr Phe Pro Glu He Arg Net Val Asp Gly 
290 295 300 

Ser Arg Glu Gin Leu His Lys Net Arg Val Glu Leu Glu Thr Pro Gin 
305 310 315 320 

Ala Leu Leu Ser Tyr Gly His Thr Leu Leu Ser He Phe Arg Ala Glu 



Phe He Lys Gly Tyr Val Ser Lys Asn Ala Lys Trp Pro Pro Val His 
340 345 350 

Leu Leu Pro Gly Cys Asp Lys Ser He Lys Asn Ala Arg Glu Leu Gly 
355 360 365 

Arg Trp Ser Pro Ala Phe Asp Arg Arg Trp Gin Leu Phe Glu Lys Val 



325 



330 



335 



370 



375 



380 
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V.l lie Leu Arg lie Ale A«p Leu Asp Net Atp Pro Asp Phe A«n Asp 
j35 390 

tie V.l Ser Asp Lye Ala lie lie Ser Ser Arg Arg Asp Trp Vel Phe 
405 *10 *" 

Glu Tyr Asn Ale Ale Ale Phe Trp lys Lys Tyr 6ly Glu Arg leu Glu 

420 425 
Arg Pro Pro Ale Arg Ser Gly Pro Ser Arg Leu Vel Asn Ale Leu lie 
435 **° . 

Asp Gly Arg Leu Asp Asn lie Pro Ale Leu Leu Glu Pro Phe Tyr Arg 
450 *'5 4*0 

Gly Ale Vel Glu Phe Glu Asp Arg Leu Thr Vel Leu Vol Pro Lys Glu 
465 470 475 

Lys Glu Leu Lys Vel Lys Gly Arg Phe Phe Ser Lye Gin Thr Leu Ale 
435 490 

lie Arg lie Tyr Gin V.l Vel Ale Glu Ale Ale Leu Lys Asn Glu Vel 

500 505 

Met Pro Tyr Leu Lys Thr His Ser Net Thr Net Ser Ser Thr Ale Leu 
515 520 

Thr His Leu Leu Asn Arg Leu Ser His Thr He Thr lys Gly Asp Ser 
530 535 5*0 

Phe Vel lie Asn Leu Asp Tyr Ser Ser Trp Cys Asn Gly Phe Arg Pro 
545 550 55 » 

Glu Leu Gin Ala Pro lie Cys Arg Gin Leu Asp Gin Net Phe Asn Cys 
565 570 *»» 

Gly Tyr Phe Phe Arg Thr Gly Cys Thr Leu Pro Cys Phe Thr Thr Phe 
580 '85 3W 

lie lie Gin Asp Arg Phe Asn Pro Pro Tyr Ser Leu Ser Gly Glu Pro 
595 600 ous 

V.l Glu Asp Gly Vel Thr Cys Ale Vel Gly Thr Lys Thr Net Gly Glu 
610 615 620 

Gly Net Arg Gin Lys Leu Trp Thr lie Leu Thr Ser Cys Trp Glu lie 
625 630 635 o*u 

lie Ala Leu Arg Glu lie Asn Vel Thr Phe Asn lie Leu Gly Gin Gly 
645 650 003 

Asp Asn Gin Thr lie lie lie His Lys Ser Ala Ser Gin Asn Asn Gin 
660 

Leu Leu Ale Glu Arg Ale leu Gly Ale Leu Tyr Lys His Ale Arg Leu 

675 "0 60 

Ala Gly His Asn Leu Lys Vol Glu Glu Cys Trp Vel Ser Asp Cys Leu 

Tyr Glu Tyr Gly Lys Lys Leu Phe Phe Arg Gly V.l Pro V.l Pro Gly 
705 710 7,5 

Cys Leu Lys Gin Leu Ser Arg Vel Thr Asp Ser Thr Gly Glu Leu Phe 
' 725 730 

Pro Asn Leu Tyr Ser Lys Leu Ala Cys leu Thr Ser Ser Cys Leu Ser 
740 745 ™» 

Ala Ala Net Ale Asp Thr Ser Pro Trp V.l Ala Leu Ale Thr Gly V.l 

755 7 » 765 

Cys Leu Tyr leu He Glu leu Tyr Vel Glu Leu Pro Pro Ale He Net 
770 775 
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Gin Asp Glu Ser Leu Leu Thr Thr Leu Cys Leu Val Gly Pro Ser tie 
785 790 795 800 

Gly Gly Leu Pro Thr Pro Ala Thr Leu Pro Ser Val Phe Phe Arg Gly 
805 810 815 

Net Ser Asp Pro Leu Pro Phe Gin Leu Ala Leu Leu Gin Thr Leu lie 
820 825 830 

Lys Thr Thr Gly Val Thr Cys Ser Leu Val Asn Arg Val Val Lys Leu 
835 840 845 

Arg lie Ala Pro Tyr Pro Asp Trp Leu Ser Leu Val Thr Asp Pro Thr 
850 855 860 

Ser Leu Asn lie Ala Gin Val Tyr Arg Pro Glu Arg Gin lie Arg Arg 
865 870 875 880 

Trp He Glu Glu Ala He Ala Thr Ser Ser His Ser Ser Arg He Ala 
885 890 895 

Thr Phe Phe Gin Gin Pro Leu Thr Glu Net Ala Gin Leu Leu Ala Arg 
900 905 910 

Asp Leu Ser Thr Net Net Pro Leu Arg Pro Arg Asp Net Ser Ala Leu 
915 920 925 

Phe Ala Leu Ser Asn Val Ala Tyr Gly Leu Ser He He Asp Leu Phe 
930 935 940 

Gin Lys Ser Ser Thr Val Val Ser Ala Ser Gin Ala Val His He Glu 
945 950 955 960 

Asp Val Ala Leu Glu Ser Val Arg Tyr Lys Glu Ser He He Gin Gly 
965 970 975 

Leu Leu Asp Thr Thr Glu Gly Tyr Asn Net Gin Pro Tyr Leu Glu Gly 
980 985 990 

Cys Thr Tyr Leu Ala Ala Lys Gin Leu Arg Arg Leu Thr Trp Gly Arg 
995 1000 1005 

Asp Leu Val Gly Val Thr Net Pro Phe Val Ala Glu Gin Phe His Pro 
1010 1015 102) 

His Ser Ser Val Gly Ala Lys Ala Glu Leu Tyr Leu Asp Ala He lie 
1025 1030 1035 1040 

Tyr Cys Pro Gin Glu Thr Leu Arg Ser His His Leu Thr Thr Arg Gly 
1045 1050 1055 

Asp Gin Pro Leu Tyr Leu Gly Ser Asn Thr Ala Val Lys Val Gin Arg 
1060 1065 1070 

Gly Glu He Thr Gly Leu Thr Lys Ser Arg Ala Ala Asn Leu Val Arg 
1075 1080 1083 

Asp Thr Leu Val Leu His Gin Trp Tyr Lys Val Arg Lys Val Thr Asp 
1090 1095 1100 

Pro His Leu Asn Thr Leu Net Ala Arg Phe Leu Leu Glu Lys Gly Tyr 
1105 1110 1115 1120 

Thr Ser Asp Ala Arg Pro Ser He Gin Gly Gly Thr Leu Thr Hit Arg 
1125 1130 1135 

Leu Pro Ser Arg Gly Asp Ser Arg Gin Gly Leu Thr Gly Tyr Val Asn 
1140 1145 1150 

He Leu Ser Thr Trp Leu Arg Phe Ser Ser Asp Tyr Leu His Ser Phe 
1155 1160 1165 



Ser Lys Ser Ser Asp Asp Tyr Thr He His Phe Gin His Val Phe Thr 
1170 1175 1180 
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Tyr Oly Cy. uu Tyr *J* •» * l "« »• ^ Gly Gly W * 1 &o 
1185 1190 

Ser Thr Pro Tyr le^Leu S.r Al. Ser Cy^Ly. Thr Cy. Phe Glu^ys 

lie Asp Ser Glu Glu Phe V.l Leu Ale Cy. Clo Pro Gin Tyr Ar, Gly 
1220 1225 ,fcJM 

U» Glu Trp Leu lie Ser ly. Pro V.l Thr W.l Pro Glu Gin He Thr 
1235 12*0 '«*' 

Asp Al. Glu V.l Glu Phe Aap Pro Cy. Vol Ser AU Gly Tyr Cy. Leu 
1255 mou 



1250 

Gly ,le Leu lie Gly Ly. Ser Phe Leu V.l Agile Ar, Al. S.r Gly^ 
1265 1270 



Hi. Asp lie net Gl^Gln Arg Thr Trp Al;**, Leu Glu Ar, PheSer 
vel Ser Asp HetGln Ly. Leu Pro Trp^Ser lie V.l lie Ar»Ser Leu 



1300 

Tro Ar9 Phe Leu lie Gly Al. Ar. Leu Leu Gin Ph. Glu Ly. Al. Gly 
1315 1320 

Al. Al. Thr Gly Pro Thr P 
1330 1»5 13W 

Het Ly. V.l Phe Gin Asp Ser Al. Leu Leu Net Asp Cy. Al. Pro Leu^ 
1345 1350 1333 

Asp Ar, 1«i ser Pro Ar, II. Aw Phe His .Ser Ar, Gly Asp L^V.l 
1365 ijfu 

M. Ly. im V.l Leu Leu Pro Phe lie A.n Pro Gly II. V.MSlu II. 
1380 1385 13yu 

Glu V.l Ser Gly lie Asn Ser Ly. Tyr HI. Al. V.l Ser Glu Al. ton 

1395 1*M U05 

Net Asp Leu Tyr II. Al. Al. Al. Ly. Ser V.l GlyV.l Ly. Pro Thr 

1410 M15 1W ™ 

Gin Ph. V.l Glu Glu Thr Asn Asp Phe Thr M. Ar, Gly Hi. His Hi^ 
1425 1*5^ 

Gly Cy. Tyr S.r Leu^r Trp Ser Ly. Ser^Ar, Aen Gin Ser Gl^V.l 
Leu Ly. net V.l.V.1 Ar, Ly. Leu LyjL.u Cy. V.l Leu TyrU. Tyr 



Leu 



tie Arg Net Leu Tyr Al. Al. Thr Gly Pro Thr Pro Ser Phe Leu 



1460 

,_. Asp P 
1475 



Pro Thr V L Asp Pro Al. V.l MiUU Asp leu Cy. Hi^Leu Pro Al. 



leu Thr II. He Leu V.l Leu Gly Gly Asp Pro M. Tyr Tyr Glu Ar, 

1490 1* 95 1WU 

t«u L.u Glu Mt A* leu Cy. Gly Al. V.l Ser Ser Ar, V.l top 
t505 I 510 

Pro Hi. Ser leu Al. Gly Ar, Thr Hi. Ar, Gly Ph. Al. v.l Gly^Pro 

1525 13JU 
A*> M. Gly ProGly V.l I le Ar, . wftp Ar, L«u Glu Se^V.l Cy. 

Tyr Al. Hi. Pro Cy. Leu Glu Glu Leu Glu Phe Asn Al^Tyr Leu Asp 

1555 1560 
S.r Glu Leu V.l Asp lie Ser Asp net Cys Cys L.u Pro leu Al. Thr 
1570 1575 
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Pro Cys Lys Ale Leu Phe Arg Pro lie Tyr Arg Ser Leu Gin Ser Phe 
1585 1590 1595 1600 



Arg Leu Ala Leu Net Asp Asn Tyr Ser Phe Vel Met Asp Leu lie Net 
1605 1610 1615 



He Arg Gly Leu Asp lie Arg Pro His Leu Glu Glu Phe Asp Glu Leu 
1620 1625 1630 

Leu Vol Val Gly Gin His lie Leu Gly Gin Pro Val Leu Val Glu Val 
1635 1640 1645 

Val Tyr Tyr Val Gly Val Val Arg Lys Arg Pro Val Leu Ala Arg His 
1650 1655 1660 

Pro Trp Ser Ala Asp Leu Lys Arg lie Thr Val Gly Gly Arg Ala Pro 
1665 1670 1675 1680 

Cys Pro Ser Ala Ala Arg Leu Arg Asp Glu Asp Cys Gin Gly Ser Leu 
1685 1690 1695 

Leu Val Gly Leu Pro Ala Gly Leu Thr Gin Leu Leu lie He Asp 
1700 1705 1710 



(12) INFORMATION FOR SE0 ID NO: 11: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(If) MOLECULE TYPE: cONA 
(fit) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(Xf) SEQUENCE DESCRIPTION: SEQ ID N0:11: 
GCCTCCCCTT AGCGACACCC TGTA 24 



(13) INFORMATION FOR SO ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS: single 

(D) TOPOLOGY: linear 

(11) MOLECULE TYPE: cDNA 
(ill) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:12: 
GAAACATATC GCGCCGTGCA 20 



(14) INFORMATION FOR SEQ ID NO:13: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA 
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(iif> HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xl) SEQUENCE DESCRIPTION: SEQ ID 1(0:13: 
TACGTTGGAG TTCTTAGGAA CC 



(15) INFORMATION FOR SEO 10 110:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base psirs 

(B) TYPE: nucleic acid 

(C) STRANDED NESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 
(HO HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEO ID NO:H: 
GAGCTTAGGG AGGCTCGCTG 



(16) INFORMATION FOR SEO ID M0:15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) ST HANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(Iv) ANTI -SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID 110:15: 
TCCTCGAGAT GAATTCAAAA CATTCCTATC 



(17) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:16: 
CTAAGGCCCT GAA6ATCGAA T 



(16) INFORMATION FOR SEQ ID N0:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii> MOLECULE TYPE: CONA 
. (Mi) HYPOTHETICAL: NO 
(Iv) ANTI-SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEO ID NO: 17: 
CCCTCGAGGA CCAAGATTT 19 

(19) INFORMATION FOR SEO ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 25 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDS) NESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(in) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEO ID N0:18: 
AGAATCATAT GGCAACGCGA CCATC 25 

(20) INFORMATION FOR SEQ ID NO: 19: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8910 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(iv) ANTI-SENS: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID 110:19: 



GTTGCGTTAA 


CAACAAACCA 


CTCATCATTC 


TTCTAACAAA ATGAACACAC GCAATGCCAC 


60 


CCAAGAGACG 


CCTGGTTGAT 


GACGCCGATG 


CCATGGAGGA TCAAGATCTA TATGAACCCC 


120 


CA GCGAGC C T 


CCCTAAGCTC 


CCTGGGAAAT 


TCCTACAATA CACCGTTGGG GGGTCTGACC 


160 


CGCAT CC G GG 


TATAGGGCAT 




TCAGGCAGAA CGCAGTGGCA TTGTTAGACC 


240 


AGTCACGGCG 


CGATATGTTT 


CACACAGTAA 


CGCCTAGCCT TGTGTTTCTA TGTTTGCTAA 


300 


TCCCAGGACT 


GCACGCTGCG 


TTTGTTCACG 


GAGGGGTGCC TCGTGAATCC TACCTGTCGA 


360 


CGCCTGTCAC 


GCGTGGAGAA 


CAGACTGTTG 


TTAAGACTGC GAAGTTTTAC GGGGAAAAGA 


420 




TGATCTCACC 


GAGCTGGAGA 


TCTCCTCTAT CTTCAGCCAT TGTTGCTCAT 


480 


TACTAATAGG 


GGTTGTGATA 


GGATCGTCGT 


CTAAGATCAA AGCAGGAGCC GAGCAGATCA 


540 


AGAAAAGGTT 


TAAAACTATG 


ATGGCAGCCT 


TAAACCGGCC ATCCCATGGT GAGACTGCTA 


600 


CACTACTCCA 


GATGTTTAAT 


CCACATGAGG 


CTATAGATTG GATTAACGGC CAACCCTGGG 


660 


TAGGCTCCTT 


TGT6TTGTCT 


CTACTAACTA 


CAGACTTTGA GTCCCCAGGT AAAGAATTTA 


720 


TGGACCAGAT 


TAAGCTTGTC 


GCAAGTTATG 


CACAGATGAC TACGTACACT ACTATAAAGG 


780 


AGTACCTCGC 


AGAATGCATG 


GATGCTACCC 


TTACAATCCC CGTAGTTGCA TATGAGATCC 


840 
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GTGACTTTTT AGAAGTTTCA GCAAAGCTTA AGGAGGATCA TGCTGACCTG TTCCCGTTTC 900 
TGGCGOCCAT TAGACACCCC GACGCTATCA AGCTGGCGCC ACGAAGCTTT CCCAATCTGG 960 
CCTCCGCAGC GTTTTACTGG AGTAAGAAGG AAAACCCCAC AATGGCAGGC TACCGGGCCT 1020 
CCACCATCCA GCCGGGCGCA AGTOTCAAGG AAACCCACCT TGCCCCGTAT AGGCGCCGCG 1080 
AGATATCTCG TGCAOAGGAC GGGGCAGA6C TCTCAGCTGA GATCTCTGCC ATAATGAAGA 1140 
TGATAG6TGT GACTGGTCTA AACTAAAAAA CAATGAACAA ACCAATAAAA AACCAAATGC 
GGCAAACCCT CCGCGACCTG CGATGAGCTC CGACCTCCGG CTGACATTGC TTGAACTAGT 
CAGGAGGCTC AATGGCAACG CGACCATCGA GTCTGGTCGA CTCCCTGGAG GACGAA6AAG 
ATCCCCAGAC ACTACGACGG GAACGACCGG GGTCACCAAG ACCACGGAAG GTCCCAAGGA 
ATGCATTGAC CCAACCAGTA GACCAGCTCC TGAAGGACCT CAGGAAGAAC CCCTCCATGA 
TCTCAGACCC AGACCAGCGA ACCGGAAGGG AGCAGCTGTC GAATGATGAG CTAATCAAGA 1500 
AGTTAGTGAC GGAGCTGGCC GAGAATAGCA TGATCGAGGC TGAGGAGGTG CGGGGCACTC 
TTGGAGACAT CTCGGCTCGT ATCGAGGCAG GGTTTGAGTC CCTGTCCGCC CTCCAAGTGG 
AAACCATCCA GACAGCTCAG CGGTGCGATC ACTCCGACAG CATCAG6ATC CTCGGCGAGA 
ACATCAAGAT ACTAGATCGC TCCATGAAGA CAATGATGGA GACAA1GAAG CTCATGATGG 
AGAAGGTGGA TCTCCTCTAC GCATCAACCG CCGTTOGOAC CTCTGCACCC ATGTTGCCCT 
CCCATCCTGC ACCTCCBXC ATTTATCCCC AGCTCCCAAG TGCCCCGACA ACGGATGAAT 
GGGACATCAT ACCATAAAAA AATCGAATCA CCATGAATTC AAAACATTCC TATGTGGAGC 1920 
TCAAGGACAA GGTAATCGTC CCTGGATGGC CCACACTGAT GCTTGAGATA GACTTTGTAG 1980 
GGGGGACTTC ACGGAACCAG TTCCTTAACA TCCCATTTCT TTCAGTGAAA GAGCCTCTGC 20*0 
AGCTTCCACG CGAGAAGAAG TTGACCGACT ACTTTAC7AT TGACGTAGAA CCAGCAGGTC 2100 
ATTCCCTGGT CAATATATAC TTCCAGATTG ACGACTTCTT 6CTCCTAACA CTCAACTCAC 2160 
TATCTGTGTA CAAGGACCCG ATTAGAAAAT ACATGTTCCT ACGCCTCAAC AAGGACCAGA 2220 
GCAAGCACGC AATCAATGCA GCCTTCAATG TCTTTTCTTA TCGGCTTCGG AACATTGGTG 2280 
TTGGTCCTCT CGGCCCGOAC ATTCGATCTT CAGGGCCTTA GCTGCAATAC TGACTCCACT 2340 
CCTGGACTGA TTGACCTGGA GATAAGGCGA CTTTGCCACA CCCCAACGGA AAATGTCATT 
TCATGCGAGG TTAGTTATCT CAACCACACG ACTATTAGCC TCCtGGCAGT CCACACATCA 
TGCCTCAAGT ACCACTGCAA AACCTATTGG GGATTCTTTG GTAGCTACAG CGCTGACCOA 
ATCATAAATC GGTACACTGG TACTGTTAAG GGTTGTCTAA ACAACTCAGC ACCAGAGGAC 2580 
CCCTTCGAGT GCAACTGGTT CTACTGCTGC TC6GCGATTA CAACAGAGAT CTGCCGATGC 
TCTATTACAA ATGTCACGGT GGCTGTGCAA ACATTCCCAC CGTTCATGTA CTCCAGTTTT 
GCAGACTGCA GTACCGTGAG CCAACAGGAG CTAGAGAGTG GAAAGGCAAT GCTGAGCGAT 
GGCAGTACAT TAACTTATAC CCCGTATATC CTACAGTCAG AAGTCGTGAA CAAAACCCTC 2820 
AATGGGACCA TACTCTGCAA CTCATCCTCT AAGATAGTTT CCTTCGATGA ATTIACGCGT 2880 
TCATACTCCC TAACGAATGG TAGTTACCAG AGCTCATCAA TCAATGTGAC GTGTGCAAAC 2940 
TACACGTCGT CCTGCCGGCC CAGGTTGAAA AGGCGGCGTA GGGACACCCA GCAGATTGAG 3000 
TATCTAGTTC ACAAGCTTAG GCCCACACTG AAAGATGCAT GGGAGGACTG TGAGATCCTC 3060 
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CAGTCTCTGC TCCTAGGGGT GTTTG6TACT CGGATCGCAA CTGCTTCTCA ATTTT7GAGG 3120 

AGCTGGtTCA ACCACCCTGA CATCATCGGG TATATAGTTA ATGGAGTTGG CGTTGTCTGG 3180 

CAATGCCATC GTGTTAATGT CACCTTCATG GCGTGGAATG ACTCCACCTA TTACCCTCCA 3240 

GTAGATTACA ATGGCCGGAA GTACTTCCTG AATGATGAGG GAAGGTTACA AACAAACACC 3300 

CCCGAGGCAA GGCCAGGCCT TAAGCGGGTC ATGTGGTTCG GCAGGTACTT CCTAGGGACA 3360 

GTAGGGTCTG GGGTGAAACC GAGGAGGATT CGGTACAATA AGACCTCACA TGACTACCAC 3420 

CTGGAGGAGT TTGAGGCAAG TCTCAACATG ACCCCTCAGA CCAGTATCGC CTCCGGTCAT 3480 

GAGACAGACC CCATAAATCA TGCCTACGGA ACGCAGGCTG ATCTCCTTCC ATACACCAGG 3540 

TCTAGTAATA TAACATCTAC GGATACAGGC TCAGGCTGGG TCCACATCGG CCTACCCTCA 3600 

TTT6CTTTCC TCAATCCCCT CGGGTGGCTC AGGGACCTAC TTGCATGGGC AGCCTGGTTG 3660 

GGTGGGGTTC TATACTTAAT AAGTCTTTGT GTTTCCTTAC CAGCCTCCTT CGCGAGGAGG 3720 

AGACGCCTCG GCCGGTGGCA GGAATAAACC GTACCGACCA GTCTCTTAAA AACCCTCTCC 3780 

TCGGAACAGA GGTCTCTTTC TGCCTTAAGT CGAGCTCACT CCCCCATCAT GTACGAGCAC 3840 

TAGGCCAGAT TAAAGCAAGG AACCTGGCAT CCTGTGACTA TTACTTGCTA TTCCGCCAAG 3900 

TTGTATTGCC CCCTGAAGTA TATCCCATTG GTGTTCTAAT AAGAGCTGCG GAGGCTATAC 3960 

TAACAGTTAT AGTATCAGCT TGGAAGCTGG ATCATATGAC GAAGACCCTA TACTCCTCTG 4020 

TGAGATATGC ACTCACCAAT CCCC6GGTCC CAGCCCAACT TGAGCTTCAC ATTGCCTACC 4080 

AGCGCATAGT GGGTCAGGTC TCGTACAGCC GGGAGGCAGA CATAGGGCCA AAAAGGCTTG 4140 

GGAATATGTC ATTGCAATTC ATCCAATCTC TCGTTATTGC CACCATAGAC ACGACAAGCT 4200 

GCCTAATGAC CTACAACCAC TTTCTTGCTG CAGCAGACAC AGCCAAGAGC AGATGCCATC 4260 

TCCTAATCGC CTCAGTGGTC CAGGGGGCCC TTTGGGAACA AGGGTCATTT CTTGATCATA 4320 

TAATCAACAT GATCGACATA ATTGACTCAA TCAACCTCCC CCATGATGAT TACTTCACAA 4380 

TTATTAAGTC TATCTTTCCC TACTCCCAAG GGCTTGTTAT GGGGAGGCAT AATGTATCAG 4440 

TCTCCTCTGA TTTCGCGTCC GTATTTGCCA TTCCTGAATT ATGCCCGCAA CTAGACAGCT 4500 

TACTAAAAAA ACTGCTCCAA CTTGACCCCG TTCTCCTCCT CATGGTCTCT TCGGTGCAGA 4560 

AGTCATGGTA CTTCCCTGAG ATCCGAATGG TCGACGGGTC ACGGGAGCAG CTCCACAAGA 4620 

TGCGTGTCGA GCTGGAAACG CCCCAAGCCC T6CTGTCGTA CGGCCATACC CTCCTGTCAA 4680 

TATTTCGGGC AGAGTTTATC AAAGGCTATG TCTCAAAGAA TGCGAAGTGG CCGCCCGTAC 4740 

ACCTGCTCCC AGGCTGTGAC AAATCCATAA AAAATGCGAG AGAGCTGGGC CGCTGGAGCC 4800 

CGGCATTTGA CCGACGATGG CAGCTCTTCG AGAAGGTTGT CATTCTAAGA ATTGCTGACC 4860 

TAGATATGGA TCCCGACTTC AACGATATTG TTAGCGATAA GGCGATAATC AGCTCAAGAA 4920 

GGGACTGGGT ATTCGAGTAC AATGCAGCGG CCTTTTGGAA GAAATACGGT GAACGGTTGG 4980 

AGAGGCCTCC TGCCAGGTCG GGACCGTCAC GACTTGTGAA TGCTCTAATC GATGGACGCT 5040 

TAGACAATAT CCCAGCCCTG CTAGAGCCAT TTTACAGGG6 AGCGGTTGAG TTCGAGGATC 5100 

GGTTGACTGT GCTCGTGCCT AAGGAGAAAG AGTTAAAGGT AAAGGGAAGG TTCTTCTCGA 5160 

AGCAAACATT GGCAATCAGG ATATATCAGG TTGTTGCTGA AGCTGCACTT AAGAATGAGG 5220 

TTATGCCATA CCTAAAGACA CACTCAATGA CCATGAGCTC AACCGCTCTA ACTCACCTTC 5280 
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TTAACCGGCT ATCACATACT ATCACTAAGG 6TCACTCCTT TGTTATTAAC CTT6ACTATA 55*0 
GTTCCTGGTG CAACGSTTTC CGACCAGAAC TGCAGCCCCC AATCTGTCGT CAGTTGSATC 5400 
AGATGTTCAA TTGCGGGTAC TTCTTCAGGA CTGGGTGCAC ACTGCCATGC TTTACCACGT 5460 
TTATTATTCA AGACAGGTTC AACCCGCCCT ATTCCCTCAG TGCTGACCCC GTTGAAGACG 5520 
GA6TTACATG CGCGGTTGGG ACTAAAACAA TGGGGGAGGG CATGAGGCAG AAACTATGGA 5580 
CAATCCTTAC GAGCTGCTGG GAGATAATTG CTCTTCGGGA AATTAACGTG ACGTTTAACA 5640 
TACTAGGCCA AGGTGATAAT CAGACAATCA TCATACATAA ATCTGCAAGC CAAAATAACC 5700 
AGCTATTAGC GGAGCGAGCA CTAGGGGCCC TGTACAAGCA TGCTAGATTA GCTGGCCATA 5760 
ACCTCAAOCT AGAGGAATGC TGC6TGTCA6 ATTGTCTOTA TGAGTAT66A AAGAACCTTT 5820 
TCTTCCGTGG TGTACCTGTC CCGGGCTGTT TGAAGCAGCT CTCACGGGTG AC6GATTCTA 5880 
CTGGAGAGCT ATTCCCAAAC CTATACTCAA AGTTAGCCTG CTTAACATCA TCGTGTTTAA 
CCGCA0C6AT GGCAGACACA TCTCCAT6G6 T6GCACTCGC GACAGGTGTC TGTCTGTATC 
TTATCGAGTT ATATGTTGAG CTGCCTCCAG CAATCATGCA GGATGAGTCG CTATTGACGA 
CCCTCTGCCT CGTAGGCCCA TCCATTGGTG GGCTTCCGAC CCCTGCAACC CTACCCAGTG 
TCTTTTTCAG AGGAATGTCC GACCCACTGC CCTTTCAGCT AGCACTCTTG CAGACCCTCA 
TTAAGACGAC AGGGGTGACC TGTAGCTTGG TGAATCGTGT GGTCAAGTTA CG6ATAGCAC 
CCTATCCAGA CTGGCTCTCT CTAGTGACTO ACCCSACCTC ACTCAACATT CCCCAAOTGT 
ACCGGCCAGA ACGTCAGATC AGGAGCTGGA TTGAGGAAGC GATAGC6ACA AGCTCACAC7 
CGTCACGCAT AGCAACTTTC TTCCAGCAGC CCCTCACGGA GATGGCTCAG TTGCTTGCGA 
G6GACCTTTC AACAATGATG CCTCTTCGAC CCCGGGATAT CTCGGCCTTA TTCGCATTAT 
CAAATGTCGC ATACGGTTTA AGCATTATAG ATCTATTTCA AAAATCCTCT ACCGTTGTTT 
CTGCAAGTCA AGCTGTCCAT ATCGA66ATG TTOCCCTAGA GAGTGTAAGG TATAAGGAAT 
CTATCATCCA GGGTCTGTTA GACACCACTG AG6GGTATAA CATGCAACCT TATTTGCAAG 
GTTGCACTTA CCTTGCAGCC AAACAGTTAC GTAGGTTGAC ATGGGGTCGA GACCTACTTG 
GAGTCACAAT GCCGTTTGTT GCCGAGCAAT TCCATCCTCA CAGTTCTGTG GGTGCAAAGG 
CGGAACTCTA CCTCGACGCT ATTATATACT GCCCACAOGA 6ACATTGCGG TCACACCATC 
TGACTACCAG GGGGGACCAG CCGCTTTACC TCGGATCCAA TACGGCTGTC AAGGTCCAGC 
GAGGTGAGAT CACGGGCCTA ACAAAGTCAA GGGCTGCAAA TCTAGTCAGG GACACTCTCG 
TTCTCCATCA GTGGTATAAA GTCCGTAAAG TTACCOATCC ACACTTGAAC ACCCTCATGG 
CACGCTTCTT ACTTGAGAAG GGGTACACAT CTGACCCTCG ACCTAGCATC CAGGGTGGGA 7080 
CCCTCAC6CA TCGTCTCCCA TCCCGCGGAG ACTCACGGCA 60G6CTTACT GGGTATGTAA 7140 
ATATACTAAG TACGTGCCTT CGATTCTCAA GTGATTATCT TCACTCTTTC TCGAAATCAT 7200 
CAGAC6ACTA TACAATCCAC TTTCAGCATG TATTCACATA CGGTTGCCTC TATGCTGATT 7260 
CGGTGATTAG ATCGGGCGGT GTTATTTCCA CTCCTTACCT TTTGAGTGCA AGTTGTAAAA 7320 
CATGCTTTGA GAAGATAGAC TCAGAGGAGT TCGTCCTGGC ATGTGAACCC CAATACAGGG 7380 
GTGCTGAGTG GCTGATATCA AAGCCAGTCA CTGTCCCTGA GCAGATAACT GATGCTGAAG 7440 
TCGAGTTTGA CCCCTOTGTG AGTGCOGGTT ATTGTCTCGG GATTCTCATT GGCAAGTCAT 7500 
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TCTTAGTTGA CATAAGGGCA AGTGGGCAT6 ATATCATGGA GCAGCGGACA TCGCCTAACC 7560 

TGGAGACGTT TTCTGTATCG GACATGCAGA AACTTCCGTG GAGTATTCTA ATTCGGTCTC 7620 

TCTGCAGATT CCTTATTGGC GCACCCCTCC TTCAGTTTGA GAAGGCTGGt CTCATTAGAA 7680 

TGCTGTATGC TGCGACAGGT CCAACCCCTA CCTTCCTAAT GAAAGTTTTT CAAGACTCAG 7740 

CCCTCCTCAT GGACTGCGCA CCCCTCGATC GGCTGTCCCC TACGATCAAC TTTCATAGTC 7800 

GGCCAGACCT CGTTGCT AAG CTTGTTTTAT TGCCCTTCAT CAACCCGGGT ATAGTGGAGA 7860 

TTGAAGTGTC TGGAATTAAT AGCAAGTACC ATGCAGTATC GGAGGCCAAT ATGGATCTGT 7920 

ACATCGCTGC TGCCAAGTCT GTGGGCGTGA AGCCCACACA GTTTGTTGAG GAAACAAACG 7980 

ACTTTACGGC CCGCGGCCAC CACCATGGTT GTTATTCCCT TTCTTGGTCT AAGTCACGCA 6040 

ATCAATCACA GGTCCTAAAG ATGGTAGTAC GGAAGCTGAA 6CTCTGTGTC CTGTATATAT 8100 

ACCCCACAGT CGATCCCGCC GTTCCTCTCC ACCTGTGCCA TCTACCAGCA TTAACTATAA 8160 

TCCTAGTGCT CGGCGGTGAC CCAGCGTACT ATGAGCGATT ACT T GAGA TG GACCTGTGCG 8220 

GGGCTGTGTC AAGTCGAGTC GATATCCCCC ATTCTCTGGC TGGCAGAACG CACAGGGGGT 8280 

TCGCAGTGGG CCCA6ACGCT GGTCCAGGTG TAATTAGACT CGACAGGTTA GACTCAGTTT 8340 

GTTATGCTCA CCCCTGTTTA GAGGAACTAG AGTTTAATGC ATATCTAGAC TCTGAGTTGG 8400 

TTGACATTAG TGATATGTGC TGCCTCCCCT TAGCGACACC CTGTAAGGCC CTTTTCAGGC 8460 

CAATATATCG GAGCTTACAG TCGTTCAGGT TAGCCTTAAT GGACAACTAT AGTTTTGTCA 8520 

TGGACCTCAT TATGATCCGA GGACTGGACA TTACCCCTCA CCTTGAGGAA TTTGACGAGC 8580 

TGCTTGTGGT AGGACAGCAC ATCCTCGGCC AGCCCGTCCT AGTAGAGGTT GTTTACTACG 8640 

TTGGAGTTGT TAGGAAGCGC CCTGTGTTAG CGAGGCATCC GTGGTCAGCA GATCTTAAGC 8700 

GAATTACTGT GGGGGGGCGG GCTCCCTGCC CCTCTGCTGC CAGATTGCGT GATGAGGATT 8760 

GTCAGGGGTC TCTGTTGGTT GGGCTTCCTG CTGGGTTGAC GCAGTTATTG ATAATTGATT 8820 

AAGATCAAGC CACCTACTAC CCTATTCTTA AAAAACCATA TGTCAGTGGT GCAGTGCTTG 6880 

GGCTTGGTTG TTGCTTT6TT GTAGCGCGTT 6910 



(21) INFORMATION FOR SEQ ID 110:20: 

(i> SEQUENCE CHARACTERISTICS: 
<A> LENGTH: 6 amino acids 
(B) TYPE: amino acid 

(if) MOLECULE TYPE: peptide 

(111) HYPOTHETICAL: NO 

(<V) ANT I -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:20: 

Met Ala Thr Arg Pro Ser Ser Leu 
1 5 

(22) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 



(ii) MOLECULE TYPE: peptide 
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(HI) HYPOTHETICAL: NO 
(iv) AMTI -SENSE: NO 

(X1) SEQUENCE DESCRIPTION: SEO ID N0:21: 

Asn Ala Leu Thr Gin Pro Val Asp Sin Leu Leu Lys 
1 5 

(23) INFORMATION FOR SEO ID NO:22: 

(f) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(it) MOLECULE TYPE: peptide 
(ill) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:22: 

Asp Gin Pro Thr Gly Arg Glu Gin 
1 5 

(24) INFORMATION FOR SEQ ID N0:Z3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 mi no acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANT I -SENSE: NO 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Vol Arg Gly Thr Leu 6ly Asp He 
1 5 

(25) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(Hi) HYPOTHETICAL: NO 

(iv) ANT I -SENSE: NO 

(xi) SEOUENCE DESCRIPTION: SEQ ID N0:24: 

Thr Ala Gin Arg Cys Asp His Ser 
1 5 

(26) INFORMATION FOR SEQ ID N0:25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 12 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
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(1v) ANTI-SENSE: NO 

(Xl) SEQUENCE DESCRIPTION: SEO ID NO: 25: 

Net GLu Thr Net Lye Leu Net Net Glu Lys Val Asp 

1 5 10 

(27) INFORMATION FOR SEO ID N0:26: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEO ID NO: 26: 

Pro Net Leu Pro Ser His Pro Ala 
1 5 

(28) INFORMATION FOR SEO ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(iii> HYPOTHETICAL: NO 
(iv) ANT I* SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ 10 N0:27: 

Thr Ala Asp Glu Trp Asp lie lie 
1 5 

(29) INFORMATION FOR SEQ ID NO:28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 
(iv) ANTI -SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

Net Asn Ser Lys His Ser Tyr Val 
1 5 

(30) INFORMATION FOR SEO ID N0:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: HO 
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(xi) SEQUENCE DESCRIPTION: SEP ID NO: 29: 

Thr Leu Met Leu GLu Ue Asp Phe 
1 5 

(31) INFORMATION FOR SEO ID NO:30: 

(I) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 12 amino acids 
(6) TYPE: ealno acid 

(ii) MOLECULE TYPE: peptide 

(Mi) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(xi) SEOUENCE DESCRIPTION: SEO ID NO: 30: 

Gly His Ser Leu Vel Asn He Tyr Phe 6ln lie Asp 
1 5 

(32) INFORMATION FOR SEO ID NO:31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 era* no acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 
(ill) HYPOTHETICAL: NO 

(iv) ANT 1- SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 

Tyr Lys Asp Pro He Aro Lys Tyr 
1 5 

(33) INFORMATION FOR SEQ ID NO: 32: 

<|> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 

(B) TYPE: amino acid 

(ii) MOLECULE TYPE: peptide 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:32: 

Ala Phe Asn Val Phe Ser Tyr Arg 

1 5 

(34) INFORMATION FOR SEQ ID N0:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2658 amino acids 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: eDNA 

(iii) HYPOTHETICAL: NO 
(IV) ANT I -SENSE: NO 
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(xi) SEQUENCE DESCRIPTION: SEQ ID KO:33: 

CGTAGACCAC CTCCTGAAGG ACCTCAGGAA GAACCCCTCC ATGATCTCAG 50 

ACCCAGACCA GCGAACCGGA AGGGAGCAGC TATCGAATGA TGAGCTTATC 100 

AAGAAGCTAG TGACGGAGCT GGCCGAGAAT AGCATGATCG AGGCTGAGGA 150 

GGTGCGGGGC ACTCTTGGGG ACATCTCGCC TCGCATCGAG GCAGGGTTTG 200 

i 

AGTCCCTGTC CGCCCTCCAA GTGGAAACCA TCCAGACAGC TCAGCGGTGC 250 

I 

GACCACTCCG ATAGCATCAG AATCCTTGGC GAGAACATCA AGATACTGGA 300 

TCGCTCCATG AAGACAATGA TGGAGACAAT GAAGCTCATG ATGGAGAAGG 350 

TGGACCTCCT CTACGCATCA ACCGCCGTTG GGACCTCTGC ACCCATGTTG 400 

CCCTCCCATC CTGCACCTCC GCGCATTTAT CCCCAGCTCC CAAGTGCCCC 450 

GACAGCGGAT GAGTGGGACA TCATACCATA AAAAAATCGA ATCACCATGA 500 

ATTCAAAGCA TTCCTATGTG GAGCTCAAGG ACAAGGTAAT CGTCCCTGGA 550 

TGGCCCACAC TGAT6CTTGA 6ATAGACTTT GTAGGAGGGA CTTCACGGAA 600 

CCAGTTCCTT AACATCCCAT TTCTTTCAGT GAAAGAGCCT CTGCAGCTTC 650 

I 

CACGCGAGAA GAAGTTGACC GACTACTTCA CCATTGACGT AGAGCCAGCA 700 

GGTCATTCCC TGGTCAACAT ATACTTCCAG ATTGACGAC7 TCTTGCTCCT 750 

AACACTCAAC TCACTGTCCG TATACAAGGA CCCGATTAGG AAATACATGT 600 

TCCTACGCCT CAACAAGGAA CAGAGCAAGC ACGCAATTAA TGCAGCTTTC 850 

I 

AATGTCTTCT CTTATCCGCT TCGGAACATT GGTGTTGGCC CTCTC6GCCC 900 

AGACATTCGA TCTTCAGGGC CTTAGTTGCA ATACTGACTC CACTCCTGGA 950 

TTAATCGATC TGGAGATAAG GCGACTTTGC CACACCCCAA CGGAAAATGT 1000 

CATTTCATGC GAGGTTAGTT ATCTTAACCA CACGACTATT AGCCTCCCCG 1050 

CAGTCCACAC GTCATGCCTC AAGTACCACT GCAAAACCTA TTGGGCATTC 1100 

TTTGCTAGCT ACAGCGCTGA CCGAATCATC AATCGGTACA CTGGTACTCT 1150 

TAAGGGTTGT TTAAACAACT CAGCGCCAGA GGATCCCTTC GAGTGCAACT 1200 

GGTTCTACTG CTGCTCGGCG ATTACAACAG AGATCTGCCG ATGCTCTATT 1250 

ACAAATGTCA CGGTGGCTGT ACAGACATTC CCACCGTTCA TGTACTGCAG 1300 

TTTCGCGGAC T6TAGTACTG TGAGTCAGCA GGAGCTAGAG AGTGGAAAGG 1350 

CAATGCTGAG CGATGGCAGT ACCTTAACTT ATACCCCGTA TATCTTACAA 1400 

TCAGAAGTCG TGAACAAAAC CCTTAATGGG ACTATACTCT GCAACTCATC 1450 

CTCCAAGATA GTTTCCTTCG ATGAATTTAG GCGTTCATAC TCCCTAGCGA 1500 

ATGGTAGTTA CCAGAGCTCA TCAATCAATG TGACGTGTGT AAACTACACG 1550 

TCGTCCTCCC GGTCCAAGTT GAGAA6GCGG CGTAGGGATA CTCAACAGAT 1600 

TGAGTACCTA GTTCACAAGC TTAGGCCTAC ACTGAAAGAT GCGTGGGAGG 1650 

ACTGTGAGAT CCTCCAGTCT CTGCTCCTAG GtffiTGTTTKS TACTGGGATT 1700 

GCAAGTGCTT CGCAATTCTT GAGGGGCTCG CTCAACCACC CT GAT AT CAT 1750 

CGGGTATATA GTTAATGGAG TTGGGGTAGT CTGGCAATGC CATCGTGTTG 1600 
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ATGTCACCTT CATGGCCTCG AATGAGTCCA CATATTACCC TCCAGTAGAT 


109U 


TACAATGGAC GGAAGTACTT TCTGAATGAT GAGGGGAGGC TACAAACAAA 


4 an a 


CACCCCCGAG GCAAGGCCAG GGCTTAAGCG GGTCATGTGG TTCCGCA6CT 


•ncn 


ACTTCCTAG6 GACAGTA6GG TCTGCCGTGA AACCGAGGAC GATTC66TAC 


ZOUU 


AATAAGACCT CACATGATTA CCATCTACAG GAGTTTGACG CAAGTCTCAA 


•sac ft 
Z05U 


CATGACCCCC CAGACCAGTA TC6CCTCGGG TCATQAGACA GACCCCATAA 


94 Aft 


ATCATCCCTA CSGAACGCAG GCTCACCTCC TTCCATACAC CACCTCTAGT 


2150 


AATATAACCT CTACACATAC AG6CTCAGGC TGG6TGCACA TCGCCCTACC 


ZZOO 


CTCATTTGCT TTCCTCAATC CTCTC6GGTG 6CTTAGCGAC CTACTTGCCT 


2250 


GGGCGGCCTG GTTGGGTGGG 6TTCTATACT TAATAAGTCT TTGTGTTTCC 


2300 


TTACCAGCCT CCTTCCCGAG GAGGAGACGC CTCG6CCGGT GGCAGGAATA 


MCA 

Z350 


AACCGTACCG ACCAATCTCT TAAAAACCCT CTTCTCGGGA CAGAGGTCTC 


2400 


TTTCTGCCTT AAATCGAGTT CACTCCCCCA TCACGTACGA GCATTGGGCC 


2450 


AGATTAAAGC AAAGAACCTG GCATCCT6TG ACTATTACTT GCTATTCCGC 


2500 


CAAGTTGTAT TGCCCCCTGA AGTATATCCC ATTGGTGTCT TAATAAGAGC 


2550 


TGCGGAGGCC ATACTAACAG TTATAGTATC AGCTTGGAAG CTGGATCACA 


2600 


TGACAAAGAC CCTATACTCC TCTGTGAGAT ATGCACTCAC CAATCCCC6G 


2650 


GTCCGGGC 





ft 



(35) I M FORMATION FOR SEO 10 110:34: 

CO SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TTPE: nucleic acid 

(C) STRANDEDNESS: single 
(0) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cONA 

<ii!) HYPOTHETICAL: HO 

(iv) ANTI-SENSE: HO 

(xi) SEQUENCE DESCRIPTION: SEO ID HO:34: 

15 

CGCAATCAAT GCAGC 

(36) INFORMATION FOR SEO ID NO: 35: 

<i> SEQUENCE CHARACTERISTICS: 
(A> LENGTH: 16 bete pairs 
(8) TYPE: nucleic acid 

(C) STRANOEDNESS: single 

<D> TOPOLOGY: linesr 
(B) TYPE: anino scid 

(ii) MOLECULE TYPE: cDNA 

(Hi) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

( xi) SEQUENCE DESCRIPTION: SEO ID N0:35: 

Ii 

TTCCTGCCAC CGGCCG 
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We claim: 

1. A nucleotide sequence selected from the group 
consisting of: 

a) SEQ ID NOs. 5, 7, 9, 19, 33; and their 
5 complementary nucleotide sequences; 

b) nucleotide sequences capable of 
hybridizing to SEQ ID NOs. 5, 7, 9, 19, 33 and their 
complementary nucleotide sequences, wherein said 
nucleotide sequence is at least 10 nucleotides in length 

10 and is not SEQ ID NOs. 1 or 3; and 

c) nucleotide sequence fragments selected 
from the group consisting of: fragments of SEQ ID NOs. 
5, 7, and 9; nucleotide number 1 through 53, and 
nucleotide number 1880 through 8910 of SEQ ID NO 19; 

15 complementary nucleotide sequences of the foregoing; and 
nucleotide sequences capable of hybridizing to the 
foregoing; wherein each of said fragment encodes a 
protein comprising at least four amino acids or at least 
one immunogenic epitope. 

20 2. An amino acid sequence selected from the group 

consisting of: SEQ ID NOs 6, 8, 10, 20 to 32; amino acid 
position 1 to amino acid position 102 of p23; amino acid 
position 37 to amino acid position 201 of p23; amino acid 
position 102 to amino acid position 201 of p23; amino 

25 acid position 154 to amino acid position 201 of p23; 
amino acid position 1 to amino acid position 70 of 
unglycosylated gpl8; amino acid position 23 to amino acid 
position 142 of unglycosylated gp!8; amino acid position 
63 to amino acid position 142 of unglycosylated gpl8; 

30 amino acid position 100 to amino acid position 142 of 
unglycosylated gp!8; SEQ ID Nos. 20 to 32; a fragment of 
a full length amino acid sequence of p23, said fragment 
is selected from the group consisting of: amino acid 
position 1 to amino acid position 102 of p23, amino acid 

35 position 37 to amino acid position 201 of p23, amino acid 
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position 102 to amino acid position 201 of P 23, amino 
acid position 154 to amino acid position 201 of p23; a 
fragment of a full length amino acid sequence of gpl8, 
said fragment is selected from the group consisting of: 
5 amino acid position 1 to amino acid position 70 of gpl8. 
amino acid position 23 to amino acid position 142 of 
gplB. amino acid position 63 to amino acid position 142 
of gpl8, amino acid position 100 to amino acid position 
142 of gpl8; fragments of the foregoing comprising at 
10 least four amino acids or at least one immunogenic 
epitope; variants and equivalent amino acid sequences of 
the foregoing. 

3. a substantially pure composition of protein 
gpl8, p57, or pol. 

15 4. A composition comprising: recplS, recp57, 

recpol, a recombinant ly produced Boma disease virus, or 
a recombinants produced virus encoded by the nucleotide 
sequence group consisting of: SEQ ID NOs. 5, 7, 9, 19, 
33; and their complementary nucleotide sequences. 

20 5 . a recombinant cell, virus, or vector, 

containing a nucleotide sequence of claim l, or a 
nucleotide sequence encoding the amino acid sequence of 
claims 2, 3, or 4. 



25 



6 An infective but replication deficient viral 
vector produced by removing or mutating at least one gene 
responsible for replication from a nucleotide sequence 
selected from the group consisting of SEQ ID No. 19, and 
a nucleotide sequence complementary or hybridizable to 
SEQ ID No. 19. 
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7. A method of delivering a desired gene or 
desired chemical into a host, comprising the step of 
administering into the host the viral vector of claim 6, 
or transplanting into the host a cell containing the 

5 viral vector of claim 6, said viral vector further 
comprising the desired gene, or a gene encoding the 
desired chemical. 

8. A method for detecting, in a sample, a ligand 
capable of binding one or more proteins selected from the 

10 group consisting of: gpl8, recpie, p57, recp57, pol, 
recpol, proteins having the amino acid sequences of claim 
2, and immunogenic fragments thereof, said method 
comprising the steps of: 

a) contacting the sample with one or more of 
15 the proteins; and 

b) determining binding of the ligand to the 

proteins . 

9. The method of claim 8, wherein the method is an 
immunoassay method, the ligand is an antibody 

20 immunoreactive with Borna disease virus, the proteins are 
prebound to a solid support without gel size 
fractionation . 

10. A method for selecting for a ligand capable of 
binding one or more protein (s) selected from the group 

25 consisting of: gpl8; recpl8; p57; recp57; pol; recpol; 
proteins having the amino acid sequences of claim 2; and 
immunogenic fragments of the foregoing; said method 
comprising the steps of: 

a) contacting the ligand with the protein (s) ; 

30 and 

b) selecting for the ligand which binds to 
the protein (s) . 
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11. The method of claim 10, wherein the ligand is 
a therapeutic agent against a disease selected from the 
group consisting of: (1) BDV infection or related 
pathogenesis; and (2) neurologic and neuropsychiatry 
5 disease not due to BDV infection, said method further 

comprising the steps of: 

a) treating patients suffering from the 
disease, with the selected ligand and assaying for 
efficacy of the selected ligand for combating the 

10 disease; and 

b) selecting the ligand which is efficacious 

against the disease, said ligand serving as the 
therapeutic agent. 



15 



20 



12. A ligand capable of binding a protein selected 
from the group consisting: gplB, P 57, recplS, recp57, 
pol, recpol, proteins having the amino acid sequences of 
claim 2, and fragments thereof which contain at least one 
immunogenic epitope. 

13 . An assay panel comprising one or more proteins 
selected from the group selected from the group 
consisting of: gp!8; recplS; P 57; recp57; pol; recpol; 
proteins having the amino acid sequences of claim 2; and 
immunogenic fragments of the foregoing. 

14 . a method for detecting a nucleotide sequence in 

25 a sample comprising the steps of: 

(a) contacting the sample to a test nucleotide 
sequence selected from the group consisting of: 
nucleotide sequences encoding genome of Borna disease 
virus, gplB. P 57, and pol; fragments of genome of Boma 

30 disease virus which are at least 10 nucleotides in length 
and between nucleotide number 1 through 53, and 
nucleotide number 1880 through 8910 of SEQ ID NO 19; 
fragments of nucleotide sequences encoding gplB, p57, 
pol which are at least 10 nucleotides in length; and 
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nucleotide sequences which are complementary or 
hybridizable to any of the foregoing nucleotide 
sequences ; 

(b) incubating the sample with the test 
5 nucleotide sequence under hybridization condition; and 

(c) detecting hybridization of any nucleotide 
sequence in the sample with the test nucleotide sequence. 

15. A method for treating a patient suffering from: 
(1) BDV infection or related pathogenesis; or (2) 

10 neurologic and neuropsychiatric disease not due to BDV 
infection, said method comprising the steps of 
administering to the patient a therapeutic amount of the 
therapeutic agent of claim 11. 

16. A vaccine comprising gpl 8, recpl8, p57, recp57, 
15 pol, recpol, p23, recp23, p40, recp40, recombinantly 

produced Borna disease virus, the proteins of claim 2, 
fragments of the foregoing, or their encoding nucleotide 
sequences, capable of provoking cellular or humoral 
immune response in an organism administered with the 
20 vaccine. 
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FIG. 2(1) 

1 GTTOGTTMCMOIA^^ MO 

HPPKRRLVDDADAHED 

101 TMAMTCTATATGAACCCCCAG^ 200 
Q D L T E P P A S L P K I P G K F I Q T T ¥ 6 6 S D P H P 6 I 6 H 

201 GAGAWGACWCAGGCA6AA^ 300 
EKDIRQHAVALIDQSRRDHFHTVTPSLVFLCLLI 

301 TCCCAGGACTGCACGCTttCT^^ 400 
PGIHAAFVHGGVPRESYLSTPVTRGEQTVVKTA 

401 GAAGMACGGGGAAAAGWGACGCAGCGTGATCTCACCGAGCTGGAGA 500 
KFYGEKTTQRDLTEIEISSIFSHCCSLIIGVVI 

501 6^TCGTC6TCTAi^ 600 

601 CACTACTCUMT6mAATC^^ m 
L L 0 H F H P H E A I D « I H G Q P H V 6 S F V I S L L T T D F E 

701 GTimAGGTAAAGAAmATGGACCAGAnM m 
SP6KEFHDQ1KIYASYAQHTTYTTIKEYIAECH 

801 GATGCTAIXOTICAATMAGT^ 900 
D A T I T 1 P V V A Y E I R D F I E V S A I L R E D H A D I F P F L 

901 TSBGBGCttnAWE«XCGIW m 
G A I R H P D A I R L A P R S F P N I A S A A F Y U S It K E R P T 

1001 AATGGCA^ACCI^ 1100 

1101 6GGGCAGAGCTCTCAGGTWGATCTCTGCWTAATGAAGATGATAGGTCT 1200 
6 A E I S 6 E I S A I H R H I G V T 6 I H * 

1201 MDUUlttricWOIttTra 1300 

HATRPSSIVD 

1301 CTCCaGWGttCGA^ 1400 
1401 GACCAGCTCCT^ 1500 
1501 AGTWGTGAC^ 1600 

1601 CCTGTCCGCCCTCCAACTGWAA^ 1700 
ISALQVETIQTA0RCDHSDS1RIIGEHIK1LDR 

1701 TCCATBAABACAATGATGGAfiACAW 1800 
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FIG. 2(2) 

1801 CCCATCCTGMItfCCGimmAYCtt^ uw 
H P A P P R 1 T P 0 I P S A P T T D E V D I I P * HNS 

1901 AAAACAncfrATGTGGAGnCA^ 2000 
K N S T V E L K D K T I V P 6 V P T 1 H L E I D F V 6 fi T S R N Q 

2001 nCCnAACATCCCATTTdnCASTBAAAfiAfiCCTCTfiWBCnCCACk 2100 
FLNIPFISVKEPLQLPREKKLT0KFTIDVEPA6H 

2101 ATCCCTGGTCMTATATACTTC^^^ 2200 
S I ¥ N I y F 0 I 0 0 F I L I T L H S L S ¥ Y I 0 P I R K T K F L 

2201 ACGCCYCAAMAGIiACCAGAGCM^ 2300 
R I H K 0 Q S K H A I N A A F R ¥ F S ¥ R L R H I G ¥ 6 P L 6 P 0 

H 0 P S H S F L I G F G T L ¥ L ¥ L S A R T 

2301 ATTCttTCTTCAGGGCtTT^ 2400 
\ \ \ S Q 6 G L S C H T 0 S T P G I 1 D L E I R R L C H T P T E R ¥ 1 

2401 TCATGIMGTTAGnATCTCAACCA^ 2500 
S C E ¥ S ¥ I R B T 1 1 S I P A ¥ H T S C I I ¥ H C K 1 ¥ V G F F G 

2501 GyAGCTAtoK&Cy6ACC6WyCATAAAyC^ 2600 
S ¥ S A 0 R I I N R ¥ ¥ G y ¥ K G C L R H S A P E D P F E C H V F 

2601 CYACYGCY6CTCGGCGAnACAACA^ 2700 
y c c s a i y y E I C R C S 1 y H v y ¥ A ¥ Q y F P P F H ¥ C S F 

2701 GCAMCYGCA6YACCGT6AkCAACAGM^ 2800 
A 0 C S y ¥ S Q Q E L E S G K A H L S 0 G S y L y Y y P ¥ I I 0 S E 

2801 AAGyCCT6AA(^CCnUU^^ 2900 
¥ ¥ R K y L N G y I L C R S S S K I ¥ S F D E F R R S Y S L y N G 

2901 YAGffiACCA6^YCA7CAAYCAATGTGACGTG7 3000 
S ¥ Q S S S I R ¥ y C A I Y Y S S C R P R L t R R R R 0 y Q Q I E 

3001 TATCTKnOCMOT^^ 3100 
Y L ¥ H I L R P Y L I 0 A N E D C E I L Q S L L L G ¥ F G T 6 I A S 

3101 GYGCYYCYCAAYjm^ m 

3201 CAC6nCAYffiC6YGGAAYWGYCCA£CYAnACCCY^ 3300 
T F H A H R E S Y Y ¥ P P ¥ 0 Y H G R K Y F I R 0 E G R L Q Y R Y 

3301 CCCGAGGCAAGGCCAGGGCnAAGCGGGYttYGyGGYYCkcAGGyACYy 3400 
P E A R P G L I R ¥ H V F G R ¥ F L G Y ¥ G S G ¥ It P R R 1 R Y R I 

3401 MaCAtiYMCYACCACCTGMGGA^ 3500 
Y S H D Y H L E E F E A S L R H Y P 0 Y S 1 A S G B E Y 0 P I H H 

3501 TIOT/ySGMttCATO 3600 
A Y G Y Q A 0 L L P ¥ Y R S S H I Y 5 Y 0 Y G S G V ¥ H 1 G L P S 
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FIG. 2(3) 

3601 TTTttnTttTDynOCte 3700 
F A F L I P I 6 H I R 0 I L A ¥ A A H L 6 G ¥ I Y L I S L C V S L P 

3701 CAGCCTCCTTCfieGAGGA6^ 3800 
ASFARRRRLGRHOE* 

3801 TWTMKra^ 3900 
3901 nfiTAnttCCCaWAM 4000 

4001 GAAGACCCTATACTCCTCTGTM^ 41011 
KTLYSSYRYAITHPRVRAOLELHIAYQRIVGQY 

4101 TCCT*C*BttS86MBCttiiw 4200 
S Y S R E A D I S P K R L fi I H S I Q F I Q S t V I A T I 0 T T S C 

4201 6CCTMTWCCTACAACCACTncn(OT^ 4308 
L H T Y I H F L A A A 0 T A K S R C H I L I A S V V Q 6 A t V E Q 

4301 AfiSGTCATTTCnWT^ 4400 
G S F L D H I I H H 1 0 1 1 0 S I H I P H 0 0 Y F T I I K S I F P 

4401 TMraMfiKTTOT^^ 4500 
YSQGlYHGRHHVSVSSOFASVFAIPElCPQlOSl 

4501 TAaAA^CTGTC^ 4600 

4601 ACGGGAGCAGUCCACAAGilTGCGTGTCGAGCTGGAAACk^ m 
R t 0 L H K H R V E L E T P 0 A I L S Y G H T L L S I F R A E F 1 

4701 AAAGGCTATGTCTCAAAGAATfr^ 4800 
K6YYSKRAKHPPVHIIPGC0KSIKNARELGRMSP 

4801 CG&CATnGjcCGACW^ 4900 
A F 0 R R ¥ 0 I F E K V » 1 I R 1 A 0 I 0 H 0 P 0 F H 0 I Y S 0 K 

4901 GGCGATMTCAGCTCAAGA^ 5000 
AMSSRROHVFEYHAAAFHKKYGERLERPPARi 

5001 GGACCGTWCGAmGTGAATGCTCTMTCWTGGACOT^ 5100 
G P S R I V R A I 1 0 G R L 0 H I P A L I E P F Y R G A Y E F E D R 

5101 GGnGACTGTGCTCGTGcdAAGWGAAAWAAA™ 5200 
L T V L Y P K E R E I K Y K G R F F 5 It Q T 1 A I R 1 Y 0 V V A t 

5201 AGCTGMCTTAA6AATGAGOTAT6CCATAttTAAAGAWWCTCAA^ 5300 
AAIKREVHPYIKTHSHTHSSTAITHIINRISHI 

5301 flOTOIIIMI^^ 5400 
I T R 6 0 S F V 1 N L 0 Y S S H C H G F R P E I 0 A P I C R Q I 0 0 
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FIG. 2(4) 

5401 AGATGTTCAATOKOT^^^ 5500 
H F H C 6 Y F F R T 6 C T L P C F T T F I I Q D R F N P P Y S L S 

5501 TGCTMGCCCOT^ 5600 

5601 MfiATMnkTCTTCfiGGM 5708 
EI IALREIHVTFHIL6QGDH0TII IHKSASQHNO 

5701 ttCTAHMtttttW^ 5800 
L L A E R A L G A L Y K H A R L A 6 H H I K V E E C V V S D C L Y 

5801 T6A6TA7G6AM6AAGCTTT7CnCC67ffl^ 5900 
EY6KKLFFRGVPVP6CLKQISRVTDSY6EI.FPN 

5901 CTATACTOMrnittCn^ 6000 
L Y S K L A C L T S S C I S A A H A 0 T S P I V A L A T G V C L Y L 

6001 nATCSAGTTATATfiTTGAGCTGCCTCCA6CAATCATGCA66AT6ASTC6CTAnSAC^ 6100 
1 E L Y » E L P P A 1 H Q 0 E S t L T T I C L Y G P S 1 G G I P Y 

6101 CCCYGCAACCCYACCCAGTGTCTMCAGAGGAAYGTCCGACCCACYGttCTnCAG^ 6200 
PAYLPSVFFRGHSDPLPFQLAILQ1L1KY1GVY 

6201 YGJAGCn(OT6AAYC61GlWcAA6nMG6AlA 6300 
C S L ¥ N R V V K I R 1 A P Y P D V L S L Y Y 0 P Y S L II I A Q Y Y 

6301 ACCGGCCAGAACG1CAGA1UGGAG6T 6400 
R P E R Q 1 R R » I E E A I A Y S S H S S R I A T F F Q 0 P L Y E 

6401 ftTGGCYCAOTGCnGCGAGG6ACCT7T^ 6500 
H A 0 L L A R 0 L S Y H H P I R P R 0 H S A L F A L S I V A Y G L 

6501 ironWAHTOATT^^ 6600 
SI10LFQKSSYVYSAS0AYHIEDVAIESYRYKES 

6601 CYAYCATCCAGGGTCYGTTAGACACCACY^ 6700 
I 1 Q G L L 0 Y Y E G Y H H 0 P Y I E 6 C T Y I A A K 0 L R R L 7 

6701 A7GGGG7CGA6ACCYAGnffiAG7CACAATGCC6ffl 6800 
7T R D L V G Y Y H P F Y A E 0 F H P H S S V G A K A E I Y I 0 A 

6801 AnATATACTGCCCACAGGAttCATO^ 6900 
I 1 Y C P 0 E Y L R S H H L Y T R G 0 0 P L Y I G S H T A Y K V Q R 

6901 GAGGTGAGAYCACGGGCCYA^GYOAGGGaGI^ 7000 
6 E 1 7 G I Y K S R A A I I Y R D T 1 Y I H 0 V Y K V R I Y Y 0 P 

7001 ACACT76AAtoCCC7CA7GttACGCnC77ACn6AGAA^ 7100 
HLRYIHARFILEKGYTSDARPS1QGGY17HRLP 

7101 TttMGGAWYCACGGCAG^ 7200 
SRGDSROGLYGYYHILSYHlRFSSOYlHSfSRSS 
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FIG. 2(5) 

7201 CAMCGACTATACAATCCAOT m 
D D Y T I H F 0 H ¥ F T Y G C I Y A 0 S Y 1 R S 6 6 Y I S T P Y L 

7301 mGACTG^CTAMAT^^ MM 

7401 M^cicreraaBto «oo 

KP¥T¥PE01T0AEYEFDPC¥SAGYC161L1GKSF 
7501 TCTTAOTfiACATAAS66CAAGT6B6MTWTCT WO 

7601 6A6TAneYkC6GTCTXTCTI^ 7700 
5IY1RSLHRFL1BARLL0FEKASLIRHLYAAT8 

7701 CCAACCCCTAfiCTTCCTAATGAAASTTTTTCMGACTCAffiCCTCCTCAT 7B00 
t T P S F L H K ¥ F 0 D S A L L H 0 C A P I 0 R L S P R I H F H S R 

7801 GGGM(iACCTC6n(nAAOT^ 

6 D L ¥ A I L ¥ L L P F I B P G I ¥ E I E ¥ S G 1 H S K Y H A ¥ S 

7901 GttGG^TATI^^ 8000 

8001 cACCATGGTT6nATTCCCmOTGm^^ WOO 
HHGCYSISHSKSRNQSOYLKHYYRKIKICYIYIY 

8101 ACCCCACAGTKATCCCGCCCTTGCTaCWCaGTGCMTCTA^ 8200 
P T ¥ 0 P A ¥ A I D L C H L P A L T I I I ¥ I G 6 0 P A Y Y E R L 

8201 AOT6AGATGWCTGTGaGG6CTM^^^ 8300 
L E M D L C G A ¥ S S R ¥ D I P H S L A G R T H R G F A ¥ G P 0 A 

8301 GGTCCAGGTCTAAnAGACTCGACAGGro MOO 
G P G ¥ I R L 0 R L E S ¥ C Y A H P C I E E L E F I A Y L 0 S E L ¥ 

8401 nWCAnACTGATATGTGCTGCCTCCCCTW 8500 
D 1 S 0 H C C I P L A T P C K A I F R P I Y R S t 0 S F R I A L H 

8501 GGAI^aATAGMGTCATttACaCATTATGATCCGAGG^ 8600 
0 R Y S F ¥ N 0 L 1 H I R 6 L D 1 R P H L E E F 0 E L L ¥ ¥ 6 Q H 

8601 ATcaCGmttM 8700 

8701 GAAHACTCTG1)G(iGGtt6HiCTCCCTG^ 8800 
1 T ¥ G G R A P C P S A A R I R 0 E 0 C Q G S L L ¥ G I P A 6 L T 

8801 (5CAGnATTttTAATJ6AriAA6ATCAAtt^ 8900 
Q L I I 1 0 * 

8901 GTAGCGCGTT 
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FIG. 3a 



p40 p23 gpl8 (p57) (pl80) 




FIG. 3b 
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TRANSCRIPTS: 
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8.9 
7.1 
6.1 





^-3.5 


- - 


«*-2.8 




<*- 1.9 




«*- 1.5 




♦ 1.2 
«+- .75 



1.2 



.9 
75 



F/G. 6b 



■7.1 
•6.1 



^ PROBES: — c T C*E* 



p40 p23 9pl8. ... r„iofti 
( 0 0 0 tP 57) X 0 (plBO) 



v tl/T2 T3 t4 T5 t6 T7 

p40 5'-GUcU AAa cua aaa^aaC AAU 6AA caa acc aau aaaaaa CCA AAU 6CG (T2) 

P23 5--CBAC AAC S6A U6A AUG BfiA CAU cau acc aua_aaaaaa UC6 AAU CAC (T3) 

gpl8 5'-UUAG CUG CAA UACUGA CUC CAC UCC UGG ACU GAU USA CCU 

P57 5'-GGCA GGA AUA AAC CGU ACC GAC cag Ucu cuu aaaaa CCC UCU CCU (t4) 

5'-CUGA AUU AUG CCC GCA ACU AGA cag cuu acu aaaaaaACUG CUC CAA (T5) 

5 ' -UACA CCU GCU CCC AGG CUG UGA caa auc cau aaaaaa UGC GAG AGA U6) 

PlBO 5'-UUM GAU CAA 6CC ACC UAC UAC ecu auu cuu aaaaaa CCA UAU GUC (T7) 

F/G. 6c 
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POLYMERASE NUCLEOTIDE SEQUENCE FIG. 18(1) 

1-2 

AT6TCATTTCATGCGAG\ /CCTCCTTCGCGA6GAGGAGACGCCTCGGCCGGTGGCAGGA 

ooqo + u +- — + + + — 3/4.3 

TACAGTAAAGTAC6CTC GGAGGAA6CGCTCCTCCTCTGCGGAGCCGGCCACCGTCCT 

ATAAACCGTACCGACCAGTCTCTTAAAAACCCTCTCCTCG6AACAGAGGTCTCTTTCTGC 

•3744 + + +. + + + — 380 3 

TATTTGGCAT6GCTGGTCAGAGAATTTTTGGGAGAGGAGCCTTGTCTCCAGAGAAAGACG 

CTTAAGTCGAGCTCACTCCCCCATCATGTACGAGCACTAGGCCAGATTAAAGCAAGGAAC 

■»ar\A + + + + + — - + Job J 

GAATTCAGCTCGAGTGAG6GGGTAGTACATGCTCGTGATCCGGTCTAATTTCGTTCCTTG 

CTG6CATCCTGTGACTATTACTTGCTATTCCGCCAAGTTGTATTGCCCCCTGAAGTATAT 

"iOCA .....+- --+ H — — +...- +.... -+... y$c£ 

GACCGTAGGACACTGATAATGAACGATAAGGCGGTTCAACATAACGGGGGACTTCATATA 



CCCATTGGTGTTCTAATAAGAGCTGCGGAGGCTATACTAACAGTTATAGTATCAGCTTGG 
GGGTAACCACAAGATTm 



AAGCTGGATCATATGACGAAGACCCTATACTCCTCTGTGAGATATGCACTCACCAATCCC 

•jofU + + + + + +--- 4043 

TTCGACCTAGTATACTGCTTCTGGGATATGAGGAGACACTCTATACGTGAGTGGTTA6GG 

CGG6TCCGAGCCCAACTTGAGCTTCACATTGCCTACCAGCGCATAGTGGGTCA6GTCTCG 

A044 + + + + + + — 4103 

GCCCAGGCTCGGGTTGAACTC6AAGTGTAACGGATGGTCGCGTATCACCCAGTCCAGAGC 

TACAGCCGGGAGGCAGACATAGGGCCAAAAAGGCTTGGGAATATGTCATTGCAATTCATC 

4104 + + + + + +--- 4163 

ATGTCGGCCCTCCGTCTGTATCCCGGTTTTTCCGAACCCTTATACAGTAACGTTAAGTAG 

CAATCTCTCGTTATTGCCACCATAGACACGACAAGCTGCCTAATGACCTACAACCACTTT 

4164 + + + + + +--- 4ZZ3 

GTTAGAGAGCAATAACGGTGGTATCTGTGCTGTTCGACGGATTACTGGATGTTGGTGAAA 

CTTGCTGCA6CAGACACAGCCAAGAGCAGATGCCATCTCCTAATCGCCTCAGTGGTCCAG 
4994 + + + ..+ + + 4ZB3 

GAACGACGTCGTCTGT6TCGGTTCTCGTCTACGGTAGAGGATTAGCGGAGTCACCAGGTC 

GGGGCCCTTTGGGAACAAGGGTCATTTCTTGATCATATAATCAACATGATCGACATAATT 

4904 .+ +. +- + — + — 4343 

CCCCGGGAAACCCTTGTTCCCAGTAAAGAACTAGTATATTAGTTGTACTAGCTGTATTAA 

GACTCAATCAACCTCCCCCATGATGATTACTTCACAATTATTAAGTCTATCTTTCCCTAC 

4044 +.. + + + + + — 4403 

CTGAGTTAGTTGGAGGGGGTACTACTAATGAAGTGTTAATAATTCAGATAGAAAGGGATG 

TCCCAAGGGCTTGTTATGGGGAG6CATAATGTATCAGTCTCCTCTGATTTCGCGTCC6TA 

4404 + +- + + + +-- - 44b J 

AGGGTTCCCGAACAATACCCCTCCGTATTACATAGTCAGAGGAGACTAAAGCGCAGGCAT 

TTTGCCATTCCT6AATTATGCCCGCAACTAGACAGCTTACTAAAAAAACTGCTCCAACTT 

44g4 .+ +-- +- + +-- + — hoc. j 

AAACGGTAAGGACTTAATACGGGCGTTGATCTGTC6AATGATTTTTTTGACGAGGTTGAA 

GACCCCGTTCTCCTCCTCATGGTCTCTTCGGTGCAGAAGTCATGGTACTTCCCTGAGATC 

AC. OA + + + + + + 4 boo 

CTGGGGCAAGAGGAGGAGTACCAGAGAAGCCACGTCTTCAGTACCATGAAGGGACTCTAG 
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FIG. 18(2) 

CGAAT66TCGACGGGTCACGS6AGCA6CTCCACAAGATGCGT6TCGAGCT66AAACGCCC 

4504 + + + + + .-.+-.- 4643 

GCTTACCAGCTGCCCAGTGCCCTCGTCGAGGTGTTCTACGCACAGCTCGACCTTTGCGGG 

CAA6CCCTGCTGTCGTACGGCCATACCCTCCTGTCAATATTTCGGGCAGAGTTTATCAAA 

4<544 + + + + + +--- 47 03 

GTTCGGGACGACAGCATGCCGGTATGGGAGGACAGTTATAAAGCCCGTCTCAAATAGTTT 

6GCTATGTCTCAAAGAATGC6AA6TGGCCGCCCGTACACCTGCTCCCAGGCTGTGACAAA 

4704 + + +-- + + + — 4/63 

CCGATACAGAGTTTCTTACGCTTCACCGGCGGGCATGTGGACGAG6GTCCGACACTGTTT 

TCCATAAAAAAT6CGAGAGAGCTGGGCCGCTGGAGCCCGGCATTTGACCGACGATGGCAG 

4764 + + + + + + — 48Z3 

AGGTATTTTTTACGCTCTCTCGACCCGGCGACCTCGGGCCGTAAACTGGCTGCTACCGTC 

CTCTTCGAGAAG6TTGTCATTCTAAGAATTGCTGACCTAGATATGGATCCCGACTTCAAC 

4824 + + + + + +--- 4883 

GAGAAGCTCTTCCAACAGTAAGATTCTTAACGACTGGATCTATACCTAGGGCTGAAGTTG 

GATATTGTTAGCGATAAGGCGATAATCAGCTCAAGAAGGGACTGGGTATTCGAGTACAAT 

4884 + + + + + +--- 4943 

CTATAACAATCGCTATTCCGCTATTAGTCGAGTTCTTCCCTGACCCATAAGCTCATGTTA 

6CAGCGGCCTTTTGGAAGAAATACGGTGAACGGTTGGAGAGGCCTCCTGCCAGGTCGGGA 

4044 + + + + + +--- 5003 

CGTCGCCGGAAAACCTTCTTTAT6CCACTTGCCAACCTCTCCG6AGGACGGTCCAGCCCT 

CCGTCACGACTTGTGAATGCTCTAATC6ATGGACGCTTAGACAATATCCCAGCCCTGCTA 

5004 + + — + + + + — 5063 

GGCAGTGCTGAACACTTACGAGATTAGCTACCTGCGAATCTGTTATAGGGTCGGGACGAT 

GAGCCATTTTACAGGGGAGC6GTTGAGTTCGAGGATCGGTTGACTGTGCTCGTGCCTAAG 

5064 + + --+ + + + — 51Z3 

CTCGGTAAAATGTCCCCTCGCCAACTCAAGCTCCTAGCCAACTGACACGAGCACGGATTC 

GAGAAAGAGTTAAAGGTAAAGGGAAGGTTCTTCTCGAAGCAAACATTGGCAATCAGGATA 

5124 + + + + + + — 5183 

CTCTTTCTCAATTTCCATTTCCCTTCCAAGAAGAGCTTCGTTTGTAACCGTTAGTCCTAT 

TATCAGGTT6TT6CTGAAGCTGCACTTAAGAATGAGGTTATGCCATACCTAAAGACACAC 

5184 + + --+ + + + — 5Z43 

ATAGTCCAACAACGACTTCGACGT6AATTCTTACTCCAATACGGTAT6GATTTCTGTGTG 

TCAAT6ACCATGAGCTCAACG6CTCTAACTCACCTTCTTAACCGGCTATCACATACTATC 

c?44 . + + — + + + + — 5303 

AGTTACTGGTACTCGAGTTGCCGAGATTGAGTGGAAGAATTGGCCGATAGTGTATGATAG 

ACTAAGGGTGACTCCTTTGTTATTAACCTTGACTATAGTTCCTGGTGCAACGGTTTCCGA 

c-»04 + + + + + +-- • 5363 

TGATTCCCACTGAGGAAACAATAATTGGAACTGATATCAAGGACCACGTTGCCAAAGGCT 

CCAGAACTGCAGGCCCCAATCTGTCGTCAGTTGGATCA6ATGTTCAATTGCGGGTACTTC 

coc/ + + + + + + — 64Z3 

GGTCTTGACGTCC6GG6TTAGACAGCAGTCAACCTAGTCTACAA6TTAACGCCCATGAAG 

TTCA6GACTGGGTGCACACTGCCATGCTTTACCACGTTTATTATTCAAGACAGGTTCAAC 

5424 + + + + + +--• 5483 

AAGTCCTGACCCACGTGTGACGGTACGAAATGGTGCAAATAATAAGTTCTGTCCAAGTTG 

CCGCCCTATTCCCTCAGTGGTGAGCCCGTTGAAGAC6GAGTTACATGCGCGGTTGGGACT 

5484 + + + + + + '-- 5643 

6GCGGGATAAGGGAGTCACCACTCGGGCAACTTCTGCCTCAATGTACGCGCCAACCCTGA 
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FIG. 18(3) 

AAAACAATGGGGGA6GGCATGAGGCAGAAACTATGGACAATCCTTACGAGCTGCTGGGAG 

5544 + + + + +--- +--- 5603 

TTTTGTTACCCCCTCCCGTACTCCGTCTTTGATACCTGTTAGGAATGCTCGACGACCCTC 

ATAATTGCTCTTCGGGAAATTAACGTGACGTTTAACATACTAGGCCAA6GTGATAATCAG 

5604 + + + + + +-- - 5663 

TATTAACGAGAAGCCCTTTAATTGCACTGCAAATTGTATGATCCGGTTCCACTATTAGTC 

ACAATCATCATACATAAATCTGCAAGCCAAAATAACCAGCTATTAGCGGAGCGAGCACTA 

5664 + + + + + +--- 5723 

TGTTAGTAGTATGTATTTAGACGTTCGGTTTTATTG6TCGATAATCGCCTCGCTCGTGAT 

GGGGCCCTGTACAAGCATGCTAGATTAGCTGGCCATAACCTCAAGGTAGAGGAATGCTGG 

5724 + + + + + +--- 5783 

CCCCGGGACAT6TTCGTACGATCTAATCGACCGGTATT6GAGTTCCATCTCCTTACGACC 

GT6TCAGATTGTCTGTATGAGTATGGAAAGAAGCTTTTCTTCCGTGGT6TACCTGTCCCG 

5784 + + + + + +--- 5843 

CACAGTCTAACAGACATACTCATACCTTTCTTCGAAAAGAAGGCACCACATGGACAG6GC 

GGCTGTTTGAAGCAGCTCTCACGGGTGACGGATTCTACTGGAGAGCTATTCCCAAACCTA 

5844 + + + + + .----+--- 5903 

CCGACAAACTTCGTCGAGAGTGCCCACTGCCTAAGATGACCTCTCGATAAGGGTTTGGAT 

TACTCAAAGTTAGCCTGCTTAACATCATC6TGTTTAAGCGCAGCGATGGCAGACACATCT 

5904 + + + + + + — 5963 

ATGAGTTTCAATCGGAC6AATTGTAGTAGCACAAATTCGCGTCGCTACCGTCTGT6TAGA 

CCAT6G6TGGCACTCGCGACAGGT6TCTGTCTGTATCTTATCGAGTTATATGTTGAGCTG 

5964 + + + + + +•-- 6023 

GGTACCCACCGT6AGCGCTGTCCACAGACAGACATAGAATAGCTCAATATACAACTCGAC 

CCTCCAGCAATCATGCAGGATGAGTCGCTATTGACGACCCTCTGCCTCGTAGGCCCATCC 

6024 + + + + + .-...+--. 60B3 

GGAGGTCGTTA6TACGTCCTACTCAGCGATAACTGCTGGGAGACG6AGCATCCGGGTAGG 

ATTGGT6GGCTTCCGACCCCTGCAACCCTACCCAGTGTCTTTTTCAGAGGAAT6TCCGAC 

6084 + + + + + +--- 6143 

TAACCACCCGAAGGCTGGGGACGTTGGGATGGGTCACAGAAAAAGTCTCCTTACAGGCTG 

CCACTGCCCTTTCAGCTAGCACTCTTGCA6ACCCTCATTAAGACGACAGGGGTGACCT6T 

6144 + + + + + +--- 6203 

GGTGACGGGAAAGTC6ATCGTGAGAACGTCTGGGAGTAATTCTGCTGTCCCCACTG6ACA 

AGCTTGGTGAATCGTGTGGTCAAGTTACG6ATAGCACCCTATCCAGACTGGCTCTCTCTA 

6704 + + + + + +--- 6263 

TCGAACCACTTAGCACACCAGTTCAATGCCTATC6TGGGATAGGTCTGACCGAGAGAGAT 

GTGACTGACCCGACCTCACTCAACATTGCCCAAGTGTACCGGCCAGAACGTCAGATCAGG 

6264 + + + + + + ""' »3Z3 

CACTGACTGGGCTGGAGTGAGTTGTAACGGGTTCACAT6GCCGGTCTTGCAGTCTAGTCC 

AGGTGGATTGAGGAA6CGATAGCGACAAGCTCACACTCGTCACGCATAGCAACTTTCTTC 

6324 + + + + + +--- 6383 

TCCACCTAACTCCTTCGCTATCGCTGTTCGAGTGTGAGCAGTGCGTATCGTTGAAAGAAG 

CAGCAGCCCCTCACGGAGAT6GCTCAGTTGCTTGC6AGGGACCTTTCAACAATGATGCCT 

6384 + + + + + +--- 6443 

GTCGTCGGGGAGTGCCTCTACCGAGTCAACGAACGCTCCCTGGAAAGTTGTTACTAC6GA 

CTTCGACCCCGGGATATGTC6GCCTTATTCGCATTATCAAATGTCGCATACGGTTTAAGC 

6444 + + + + + +--- 6503 

GAAGCTG6GGCCCTATACAGCCGGAATAAGCGTAATAGTTTACAGCGTATGCCAAATTCG 
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FIG. 18(4) 

ATTATAGATCTATTTCAAAAATCCTCTACCGTTGTTTCTGCAAGTCAAGCT6TCCATATC ece ^ 

TAATATCTAGATAAAGTTTTTAGGAGATGGCAACAAAGAC6TTCAGTTCGACAGGTATAG 

GAGGAT6TTGCCCTAGAGAGTGTAAGGTATAAGGAATCTATCATCCAGGGTCTGTTAGAC 

6564 + + + + + +--- 6523 

CTCCTACAACGGGATCTCTCACATTCCATATTCCTTAGATAGTAGGTCCCAGACAATCTG 

ACCACTGAGGGGTATAACATGCAACCTTATTTGGAAGGTTGCACTTACCtTGCAGCCAAA 

6624 + + + + + +--- 6683 

TGGT6ACTCCCCATATTGTACGTTGGAATAAACCTTCCAACGTGAAT6GAACGTCGGTTT 

CAGTTACGTAGGTTGACATGGGGTCGA6ACCTAGTT6GAGTCACAAT6CCGTTTGTTGCC 

6684 + + + + + - 6743 

GTCAATGCATCCAACTGTACCCCAGCTCTGGATCAACCTCAGTGTTACGGCAAACAACGG 

6AGCAATTCCATCCTCACAGTTCTGTGGGTGCAAAGGCGGAACTCTACCTCGACGCTATT 

6744 + + + + + +--- 6803 

CTCGTTAAGGTAGGAGTGTCAAGACACCCACGTTTCCGCCTTGAGATG6AGCTGCGATAA 

ATATACTGCCCACAGGAGACATTGCGGTCACACCATCTGACTACCAGGGGGGACCAGCCG eoe ^ 

6804 + + + + + --+--- 6863 

TATATGAC6GGTGTCCTCTGTAACGCCAGT6TGGTAGACTGATG6TCCCCCCTGGTCGGC 

CTTTACCTCGGATCCAATACGGCTGTCAAGGTCCAGCGAGGTGA6ATCAC6GGCCTAACA 

6864 + + + + + + -" 6923 

GAAATGGAGCCTAGGTTATGCCGACA6TTCCAGGTCGCTCCACTCTAGTGCCCGGATTGT 

AAGTCAAGGGCTGCAAATCTAGTCAGGGACACTCTCGTTCTCCATCAGTGGTATAAAGTC 

6924 + + + + + +--- 6983 

TTCAGTTCCCGACGTTTAGATCA6TCCCTGTGAGAGCAAGAGGTAGTCACCATATTTCAG 

CGTAAAGTTACCGATCCACACTTGAACACCCTCATG6CACGCTTCTTACTTGAGAAGGGG 

6984 + + + + + ----+--- 7043 

GCATTTCAATGGCTAGGTGTGAACTTGTGGGA6TACCGTGCGAAGAATGAACTCTTCCCC 

TACACATCTGACGCTC6ACCTAGCATCCA6GGT6GGACCCTCACGCATCGTCTCCCATCC 

7044 + + + + + +--- 7103 

ATGTGTAGACTGC6A6CTGGATCGTAGGTCCCACCCTGGGAGTGC6TAGCAGAGGGTAGG 

C6CGGAGACTCACGGCAGGGGCTTACTGGGTATGTAAATATACTAAGTACGTGGCTTCGA 

7104 + + + + + +--- 7163 

GCGCCTCTGAGTGCCGTCCCCGAAT6ACCCATACATTTATATGATTCATGCACCGAAGCT 

TTCTCAAGTGATTATCTTCACTCTTTCTCGAAATCATCAGACGACTATACAATCCACTTT 

7164 + + + + + +--- 72Z3 

AAGAGTTCACTAATA6AAGTGAGAAAGAGCTTTA6TAGTCTGCTGATATGTTAGGTGAAA 

CAGCAT6TATTCACATACGGTTGCCTCTATGCTGATTC6GTGATTAGATCGGGCGGTGTT 

7224 + + + + + +--- 7283 

GTCGTACATAAGTGTATGCCAACGGAGATACGACTAAGCCACTAATCTAGCCCGCCACAA 

ATTTCCACTCCTTACCTTTTGAGTGCAAGTTGTAAAACATGCTTTGAGAAGATAGACTCA 

7284 + + + + + +--- 7343 

TAAAGGTGAGGAATGGAAAACTCACGTTCAACATTTTGTACGAAACTCTTCTATCTGAGT 

GAGGAGTTCGTCCTGGCATGT6AACCCCAATACAGGGGTGCTGAGTGGCTGATATCAAAG 

7344 + + + + + +-- 7403 

CTCCTCAAGCAGGACCGTACACTTGGGGTTATGTCCCCACGACTCACCGACTATAGTTTC 

CCAGTCACTGTCCCTGAGCAGATAACTGATGCTGAAGTCGAGTTTGACCCCTGTGTGAGT 

7404 + + + + + +--- 7463 

G6TCAGTGACAGGGACTCGTCTATTGACTACGACTTCAGCTCAAACTGGGGACACACTCA 
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FIG. 18(5) 

GC6fiGTTATTGTCTC6G6ATTCTCATTGSCAAGTCATTCTTAGTT6ACATAA66GCAAGT 

7 4 C A + -+- -+- ------- -+ / OCO 

C6CCCAATAACAGAGCCCTAAGAGTAACCGTTCAGTAAGAATCAACTGTATTCCCGTTCA 
G6GCATGATATCATGGA6CAGCGGACATGGGCTAACCTGGAGAGGTTTTCTGTATCGGAC 

+ •--+ + + + + /30J 

CCCGTACTATA6TACCTCGTCGCCTGTACCCGATTGGACCTCTCCAAAAGACATAGCCTG 
AT6CAGAAACnCCGTGGAGTAnGTAAnCGGTCTCTCTGGAGATTCCTTATTGGCGCA ^ 

7584 TACGTCTTT6AAGGCACCTCATM 

CGGCTCCnCAGTnGA6AAGGCTG6CCTCAnAGAAT6CTGTATGCT6CWM jm 

7644 GCCGAGGAAGTCAAACTCTTCCGACCG^ 

ACCCCTAGCTTCCTAAT6AAAGTTTTTCAAGACTCAGCCCTCCTCATGGACTGCGCACCC 

7 -in A + + +- h + 1 10S 

TGGGGATCGAA6GATTACTTTCAAAAAGTTCTGAGTCGGGAGGAGTACCTGACGCGTGGG 
CTC6ATCGGCTGTCCCCTAGGATCAACTTTCATAGTCGGGGAGACCTCGTTGCTAAGCTT 

71 C A + _ 4 + -+ h + 

GAGCTAGCCGACAGGGGATCCTAGTTGAAAGTATCAGCCCCTCTGGAGCAACGATTCGAA 

GTTTTATTGCCCTTCATCAACCCGGGTATAGTGGAGATTGAAGTGTCTGGAATTAATAGC 

7004 + + + + + + — /Boo 

CAAAATAACGGGAAGTAGTTGGGCCCATATCACCTCTAACTTCACA6ACCTTAATTATCG 

AAGTACCATGCAGTATCGGAGGCCAATATGGATCTGTACATCGCTGCTGCCAAGTCTGTG 

7004 + + +- -+ + + — 

TTCATGGTACGTCATAGCCTCCGGTTATACCTAGACATGTAGCGACGACGGTTCAGACAC 

GGCGTGAAGCCCACACAGTTTGTTGAGGAAACAAACGACTTTACGGCCCGCGGCCACCAC 

iqaa + + + + + + — auui 

CC6CACTTCGGGTGTGTCAAACAACTCCTTTGTTTGCTGAAATGCCGGGCGCCGGTGGTG 

CAT6GTTGTTATTCCCTTTCTTGGTCTAAGTCACGCAATCAATCACAGGTCCTAAAGATG 

onnA +...- + — + + + ---+ — oUbo 

6TACCAACAATAAGGGAAAGAACCAGATTCAGTGCGTTAGTTAGT6TCCAGGATTTCTAC 

GTAGTACGGAAGCTGAA6CTCTGTGTCCTGTATATATACCCCACAGTCGATCCCGCCGTT 

QftfLA +__.-.- + + .--+ OICO 

CATCATGCCTTCGACTTCGAGACACAGGACATATATATGGGGTGTCAGCTAGGGCGGCAA 
GCTCTCGACCTGTGCCATCTACCAGCAnAACTATAATCCTAGTGCTCGGCGGTGACCCA gi83 

8124 CGAGAGCTGGACACGGT^ 

GCGTACTATGA6CGATTACTTGAGATGGACCTGTGCGGGGCTGTGTCAAGTCGAGK ^ 

8184 CGCATGATACTCGCTAATGAACTCTAC^ 

ATCCCCCATTCTCTGGCTGGCAGAACGCACAGGGGGTTCGCAGTG66CCCAGACGCTGGT 

00^ . -+-- -+-- -.--+-- --+- + ojUj 

TAGGGGGTAAGAGACCGACCGTCTTGCGTGTCCCCCAA6C6TCACCC6G6TCTGCGACCA 

CCA6GTGTAATTAGACTCGACAGGTTAGAGTCAGTTTGTTATGCTCACCCCTGTTTAGAG 

poftA - ._+ + + + + + — oood 

GGTCCACATTAATCT6AGCTGTCCAATCTCAGTCAAACAATACGAGTGGGGACAAATCTC 

GAACTAGAGTTTAATGCATATCTAGACTCTGAGTTGGTTGACATTAGTGATATGTGCTGC 

B"iCA + + + + + + o4£o 

CTTGATCTCAAATTACGTATAGATCTGAGACTCAACCAACTGTAATCACTATACACGACG 
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FIG. 18(6) 



8424 
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8544 
8604 
8664 
8724 
8784 



CTCCCCnA6C6ACACCCTGTAA6GCCCTmCAG6CCAATATATCG6A6CTTACA6TCG ^ 
GAGGGGAATCGCTGT6GGACAfrCck 

TTCAGGTTAGCCnAATGGACAACTATAGTTnGTCATGGACCTCATTATGATCCGA^ ^ 
AAGTCCAATCGGAATTACCTGTTGATA^ 

CTGGACATTAGGCCTCACCnGAGGAATTTGACGAGCTGCTTGTGGTAGGACAGCACAT 
GACCTGTAAT CCGGAGTGGAAa 

CTCGGCCAGCCCGTCCTAGTAGAGGnGTnACTACGTTGGAGTTGTTAGGAAGCGCCCT ^ 
GAGCCGGTCGGGCAGGATCATCK 



GGGACGGGGAGACGACbbTUAAlblAL IALILLI AALAu I UA 

CTTCCTGCTGGGTTGACGCAGTTATTGATAATTGATTA 

+ + + — +- 8BZ1 

GAA6GACGACCCAACT6CGTCAATAACTATTAACTAAT 
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FIG. 19(1) 

1 GGTA6ACCAG CTCCTGAAGA ACCTCAGGAA GAACCCCTCC ATGATCTCAG 

51 ACCCAGACCA 6CGAACCGGA AGGGAGCAGC TATCGAATGA TGAGCTTATC 

101 AAGAAGCTAG TGACGGAGCT GGCCGAGAAT AGCATGATCG AGGCTGAGGA 

151 GGTGCGGGGC ACTCTTGGGG ACATCTCGGC TCGCATCGAG GCAGGGTTTG 

201 AGTCCCTGTC CGCCCTCCAA GTGGAAACCA TCCAGACAGC TCAGCGGTGC 

251 GACCACTCCG ATAGCATCAG AATCCTTGGC GAGAACATCA AGATACTGGA 

301 TCGCTCCATG AAGACAATGA TGGAGACAAT GAAGCTCATG ATGGAGAAGG 

351 TGGACCTCCT CTACGCATCA ACCGCCGTTG GGACCTCTGC ACCCATGTTG 

401 CCCTCCCATC CTGCACCTCC GC6CATTTAT CCCCAGCTCC CAAGTGCCCC 

451 6ACAGCGGAT GAGTGGGACA TCATACCATA AAAAAATCGA ATCACCATGA 

501 ATTCAAAGCA TTCCTATGTG GAGCTCAAGG ACAAGGTAAT CGTCCCT6GA 

551 TGGCCCACAC TGATGCTTGA GATAGACTTT GTAGGAGGGA CTTCACGGAA 

601 CCAGTTCCTT AACATCCCAT TTCTTTCAGT GAAAGA6CCT CTGCAGCTTC 

651 CACGCGAGAA GAAGTTGACC GACTACTTCA CCATTGACGT AGAGCCAGCA 

701 GGTCATTCCC TGGTCAACAT ATACTTCCAG ATTGACGACT TCTTGCTCCT 

751 AACACTCAAC TCACTGTCCG TATACAAGGA CCCGATTAGG AAATACATGT 

801 TCCTACGCCT CAACAAGGAA CAGAGCAAGC ACGCAATTAA TGCAGCTTTC 

851 AATGTCTTCT CTTATC66CT TCGGAACATT GGTGTTGGCC CTCTCGGCCC 

901 AGACATTCGA TCTTCAGGGC CTTAGTTGCA ATACTGACTC CACTCCTGGA 

951 TTAATCGATC TGGAGATAAG GCGACTTTGA CACACCCCAA CGGAAAATGT 

1001 CATTTCATGC GAGGTTAGTT ATCTTAACCA CACGACTATT AGCCTCCCGG 

1051 CAGTCCACAC GTCATGCCTC AAGTACCACT GCAAAACCTA TTGGGGATTC 

1101 TTTGGTAGCT ACAGCGCTGA CCGAATCATC AATCGGTACA CTGGTACTGT 

1151 TAAGGGTTGT TTAAACAACT CAGCGCCAGA GGATCCCTTC GAGTGCAACT 

1201 GGTTCTACTG CTGCTCGGCG ATTACAACAG AGATCTGCCG ATGCTCTATT 

1251 ACAAATGTCA CGGTGGCTGT ACAGACATTC CCACCGTTCA TGTACTGCAG 

1301 TTTCGCGGAC TGTAGTACTG TGAGTCAGCA GGAGCTAGAG AGTGGAAAGG 

1351 CAATGCTGAG CGATGGCAGT ACCTTAACTT ATACCCCGTA TATCTTACAA 

1401 TCAGAAGTCG TGAACAAAAC CCTTAATGGG ACTATACTCT GCAACTCATC 

1451 CTCCAAGATA GTTTCCTTCG ATGAATTTAG GCGTTCATAC TCCCTAGCGA 
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FIG. 19(2) 
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TCAATCAATG TGACGTGTGT AAACTACACG 
GAGAAGGCGG CGTAGGGATA CTCAACAGAT 
TTAGGCCTAC ACTGAAAGAT GCGTGGGAGG 
CTGCTCCTAG GGGTGTTTGG TACTGGGATT 
GAGGGGCTGG CTCAACCACC CTGATATCAT 
TTGGGGTAGT CTGGCAATGC CATCGTGTTG 
AATGAGTCCA CATATTACCC TCCAGTAGAT 
TCTGAATGAT GAGGGGAGGC TACAAACAAA 
GGCTTAAGCG GGTCATGTGG TTCGGCAGGT 
TCTGGGGTGA AACCGAGGAG GATTCGGTAC 
CCATCTAGAG GAGTTTGAGG CAAGTCTCAA 
TCGCCTCGGG TCATGAGACA GACCCCATAA 
GCTGACCTCC TTCCATACAC CAGGTCTAGT 
AGGCTCAGGC TGGGTGCACA TCGGCCTACC 
CTCTCGGGTG GCTTAGGGAC CTACTTGCGT 
GTTCTATACT TAATAAGTCT TTGTGTTTCC 
GAGGAGACGC CTCGGCCGGT GGCAGGAATA 
TAAAAACCCT CTTCTCGGGA CAGAGGTCTC 
CACTCCCCCA TCACGTACGA GCATTGGGCC 
GCATCCTGTG ACTATTACTT GCTATTCCGC 
AGTATATCCC ATTGGTGTCT TAATAAGAGC 
TTATAGTATC AGCTTGGAAG CTGGATCACA 
TCTGTGAGAT ATGCACTCAC CAATCCCCGG 
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FIG. 25 
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